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PREFACE 


With this volume, the Annual Review of Sociology completes its first 15 years 
of continuous publication. While this may not seem a long time, 15 years is an 
accomplishment for any edited series. 

The index of cumulative chapters (included at the end of this volume) 
provides about as accurate a guide as can be found to both the continuing and 
the varied and changing interests of sociologists. General topics such as 
justice, inequality, mobility, migration, social movements and social roles 
recur throughout the period, but there is evidence for increased interest among 
sociologists in studying the state, life course processes, gender, culture, and 
the environment. 

Sociology has also become less provincial and more connected with other 
disciplines over the 15-year period. More attention is paid to the contributions 
of—and more articles are authored or co-authored by—anthropologists, eco- 
nomists, historians, political scientists, psychologists, and legal scholars. 
Also, non-US authors appear more frequently, and more chapters attend to 
non-Western experiences. 

Yet another interesting development is that subfields within sociology are 
beginning to interact in productive ways. In recent years, for example, we 
have had essays reviewing demographic processes within organizations, the 
effect of business interests on urban power structures, and in the current 
volume, the role of the state in influencing life course processes. We are 
beginning to understand that complex social structures are the result of 
interaction as well as the main effects of key variables. 

Our colleagues inform. us that the Annual Review of Sociology is in- 
creasingly an important tool in their teaching and research—the publication 
' they seek out when they are preparing their lectures or conducting a literature 
review for a research project. Best of all, we think, it is the reference to which 
graduate students turn as they prepare for their prelims or develop their 
dissertation plans. These are our targeted, favored audiences. If we can put 
intelligent reviews of the latest and best sociological work in the hands of 
those who are teaching or who are themselves preparing to become the next 
generation of scholars, we will have performed an important service for our 
discipline and, accordingly, for our societies. 
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SOME REFLECTIONS ON THE 
GOLDEN AGE OF | 
INTERDISCIPLINARY SOCIAL 
PSYCHOLOGY 


William H. Sewell 


Department of Sociology, University of Wisconsin, Madison, Wisconsin 53706 


Abstract 


In the 25 years or so that began with World War I, there was a great wave of 
enthusiasm for interdisciplinary social psychology which resulted in the 
establishment of interdisciplinary social psychology training and research 
programs in some of the major universities in the United States. By the 
mid-1960s however, this seeming, Golden Age had largely vanished. This 
article, by one of the participants in this movement, is devoted to an elabora- 
tion of how this Golden Age came about and the forces that led to its demise. 
Its origins are traced to the World War II experiences of social psychologists 
in interdisciplinary research on the adjustments of the American soldier under 
the leadership of Samuel Stoffer and with Rensis Likert on the US strategic 
bombing surveys in Germany and Japan. Many of the participants in this 
research were greatly impressed by the fruitfulness of interdisciplinary col- 
laboration and were determined to establish interdisciplinary social psycholo- 
gy programs on their return to their universities. Several of these programs 
were very successful for a number of years, especially those at Harvard and 
Michigan, but failed to survive and become integrated into the institutional 
structure of the American university. The reasons for their failures are com- 
plex but at least four factors seem to have been important. First, the threat of 
these programs to the traditional departmental structure of the university— 
particularly in light of the relatively weak position of the social sciences in 
that structure. Second, the lack of adequate and appropriate funding from 
either university or federal sources. Third, the lack of a major breakthrough in 
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social psychological theory. Fourth, advancements in research methods did 
not produce greatly increased understanding of social psychological phe- 
nomena. These factors are examined and ccntrasted with the situation in the 
natural sciences, particularly with moleculer biology. 


Introduction 


In a perceptive article, “The Three Faces cf Social Psychology,” James S. 
House (1977) pointed out that in the 25 years or so that began with World War 
II, there was a great wave of enthusiasm for interdisciplinary social psycholo- 
gy. This led to the establishment of several s_gnificant interdisciplinary social 
psychology training programs and researck centers in some of the major 
universities in the United States. By the mit-1960s, however, this seeming 
Golden Age of interdisciplinary social psychology had largely vanished. By 
the mid-1970s, it had been almost completely replaced by three separate and 
largely isolated divisions of social psychology: psychological social psycholo- 
gy now focuses on individual psychologicel processes as related to social 
stimuli, emphasizing the use of laboratory experimental methods; symbolic 
interactionism concentrates on face-to-face social interaction processes, using 
participant observation and informal interviewing in natural settings; and 
psychological sociology focuses on the reciprocal relation between social 
structure and individual social psychological behavior, relying mainly on 
survey methods. House further asserted that this fractionation of social psy- 
chology grew out-of the institutional and intelectual contexts in which social 
psychology originally developed, that the th-ee factions have grown further 
apart over the last two decades, and that there was great need for more 
interaction between them, if a vital and well-zounded social psychclogy were 
to develop. 

For the most part I agree with House’s formulations and conclusions, 
although I still prefer to describe what most of us do with the traditional label, 
social structure and personality, rather thar with his term, psychological 
sociology. Like Sheldon Stryker (1987), I se2 a somewhat less clear distinc- 
tion between the present stance of symbolic interactionism and social struc- 
ture and personality than House did a decade ago. This is particularly true 
now. that many symbolic interactionists are using formal observation, sample 
surveys, and multivariate analysis in their research. 

This brief review of House’s article serves as background for my own 
reflections on what some call the Golden Age of interdisciplinary social 
psychology. I wish to elaborate on how it came about and to describe the 
forces that led to its demise. I agree with House that the intellectual and 
institutional contexts in which each faction developed probably predetermined 
its return to its original disciplinary moorings once the interdisciplinary 
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arrangements faltered. I wish to reflect on the circumstances that may have 
contributed to the failure of these programs to become part of the institutional 
structure of our universities, while several post-war interdisciplinary pro- 
grams in the natural sciences succeeded. 


Background 


I am neither a qualified historian of science nor a sociologist of knowledge, 
but I was one of the many actors in the movement and I participated in almost 
every aspect of it, its successes and its failures. Thus, I feel emboldened to 
share my reflections on it. Like most other participants, I had completed my 
graduate training in sociology before World War U, with a major interest but 
inadequate training in social psychology. There were few places where one 
could obtain much in the way of training in social psychology in the mid- 
1930s, and Minnesota, where I did my PhD in sociology, was not one of 
them.’ By the time I was called to service in World War II as a reserve officer 
in the US Navy, I was already a fairly well-established sociologist. I had read 
widely in social psychology and had done research and teaching in the field. 
On entering active military service, I was assigned to the staff of the Research 
Division of the National Headquarters of Selective Service, where along with 
other social scientists I did research on civilian and military manpower.” 
During this period, through contacts with Samuel A. Stouffer and members of 
his staff, I became well acquainted with the interdisciplinary research pro- 
gram of the Information and Education Division of the War Department, most 
of which involved studies of the adjustment of soldiers to military life during 
World War II. Much of this research was published, under Stouffer’s leader- 
ship, in the famous four-volume work The American Soldier (1949). I also 
became acquainted with Rensis Likert and several of his colleagues who 
directed the Program Surveys Division of the Department of Agriculture 
which had been doing social psychological studies of the civilian population 
for several departments of the Government during the war years. 

After the surrender of Germany, Likert asked me to join a group that was 


‘At that time the leading centers for social psychology training were Chicago, Columbia, and 
Harvard, but even at these institutions the offerings were not extensive. At Minnesota I had 
reading courses with Clifford Kirkpatrick and sat in on a course in social psychology in the 
psychology department. This course was devoted largely to group differences in ability and 
attitudes and gave some attention to collective behavior. Fortunately, as an undergraduate, I had 
had courses in sociology and philosophy at Michigan State in which I had read much of Dewey 
and Cooley and some of Mead-——at that time Mind, Self and Society (1934) had not yet been 
published. 

2Others involved in this research were Kenneth McGill, Raymond V. Bowers, C. Amold 
Anderson, Harold Faulk, Robert N. Ford, J. Mapheus Smith, and Louis Levine. 
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making preliminary plans for a study of the influence of strategic bombing on 
Japanese civilian morale. (Likert had directed a similar study in Germany; see 
US Strategic Bombing Survey, 1946). This group included some members of 
the team that had conducted the German survey and several of the social 
scientists who were to conduct the survey in. Japan. We drew up preliminary 
plans for the Japanese survey, including a clear conceptualization of the aims 
of the survey, a list of the major components of morale, and a series of 
questions designed to elicit these. 

Within days after the surrender, the interdisciplinary team of psychologists, 
sociologists, anthropologists, political scientists, a psychiatrist, and sampling 
statisticians, who were to carry out the study had assembled in Tokyo.’ We 
immediately began to review the purposes and design of the survey and made 
many important revisions in both the concep-ual guides for the study and the 
content of the survey instrument. We then pretested the interview schedule 
on Japanese civilians, using Japanese-American interviewers. These inter- 
viewers had participated in many of our meetings and were well acquainted 
with the purposes of the research. The survey directors and the interviewers 
then participated in the final revision of the interview schedule. 

Meanwhile, our sampling experts had designed and drawn a probability 
sample of the Japanese adult civilian population consisting of approximately 
3000 persons. We then took our teams of interviewers into the field and 
completed the interviewing in a ‘period of <hree more months. In another 
month or so, after returning to Washington DC, we had developed a coding 
scheme for the interviews, coded the materials, and completed the statistical 
processing of the data. By then several of our number had been released from 
service and returned to their academic posts. Those of us who remained, with 
assistance from some of our departed colleagues, wrote the final report which 
was then sent to the printers. All of this was accomplished within less than a 
year after our arrival in Japan. The report wes published by the Government 
Printing Office in 1947 (US Strategic Bomting Survey, 1947). 

Throughout this endeavor I was very much impressed with the fruitfulness 
of interdisciplinary collaboration among bright and willing social scientists. 
In general the most innovative and insightful ideas were generated as a result 
of group discussions in which little regard was paid to the disciplinary origin 
of the idea. I was greatly impressed also with the ability of an interdisciplinary 
team to mount a study of this complexity and to move it to completion so 


>The group, included David Aberle, Conrad Arensberg, Jules Henry, and Fredrick Hulse 
(anthropologists); Donald Adams, Edgerton Ballachey, and Horace English (psychologists); 
Raymond Bowers, Burton Fisher, and William Sewell (sociologists); Morris Hansen and Harold 
Nisselson (statisticians); David Truman and Harold Nissen (political scientists), and Alexander 
Leighton (psychiatrist). 
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expeditiously.* My colleagues on the Bombing Survey, as well as those who 
had participated in other wartime interdisciplinary social psychology research 
projects, were equally impressed with their experiences and were determined 
to promote interdisciplinary training and research programs in social psychol- 
ogy on return to academic life. 

Moreover, the private foundations, especially Ford, Rockefeller, Carnegie, 
and Sage, along with the Social Science Research Council, were stressing 
interdisciplinary social psychology. The Office of Naval Research, the 
National Institutes of Health, and later the National Science Foundation also 
were supportive of interdisciplinary research and training programs in social 
` psychology. I served on and was chairman of several research grant and 
training committees during the period of expansion of interdisciplinary social 
psychology. Through this activity I came to know most of the leaders in this 
movement, and throughout the period I was involved with them in the 
promotion of interdisciplinary social psychology on the national level, as well 
as with others at the University of Wisconsin.° 

As a result of all of this enthusiasm, activity, and support, interdisciplinary 
programs for graduate training were developed at Michigan, Harvard, Yale, 
Cornell, Berkeley, Columbia, Minnesota, Wisconsin, and other leading uni- 
versities. In addition the National Opinion Research Center was moved to the 
University of Chicago, with a broadly expanded program under the direction 
of Clyde Hart; a new national research center, the Institute for Social Re- 
search, was established at the University of Michigan under the direction of 
Rensis Likert; and the Bureau of Applied Social Research was established 
at Columbia under the leadership of Paul F. Lazarsfeld. More locally ori- 
ented survey research centers were developed at Harvard, Yale, Princeton, 
Berkeley, UCLA, Illinois, Minnesota, and Wisconsin, to mention only the 


4] wish it were possible to produce actual illustrative examples.of how the work of the team 
was influenced by its interdisciplinary composition. I am sure that it was, but my memory of the 
instances is no longer reliable. I took no notes on our meetings and conferences, nor do I 
remember that anyone else did except Edgerton Ballachey. His notes on the Tokyo meetings, and 
those of David Krech on the earlier meetings in Washington, formed the basis for their syllabus, 
A Case Study of a Social Survey, (1948), but they do not discuss the disciplinary sources of ideas, 
the definitions of concepts, the hypotheses tested, or the analytic strategies of the study. Again, 
what I remember most clearly is that some of the best suggestions came from persons who were 
not directly identified with the discipline with which one would normally associate the idea. This 
is not too surprising because all of us had a strong commitment to social psychology regardless of 
our disciplinary identifications. 

>Among the national leaders not otherwise mentioned in this paper were: Gardner Murphy, 
Richard Crutchfield, Clyde Coombs, Ernest Hilgard, Charles Osgood, Harold Kelley, Goodwin 
Watson, Muzafer Sherif, Urie Bronfenbrenner, M. Brewster Smith, David Riesman, Leonard 
Cottrell, Robin Williams, Amold Rose, Fred Strodtbeck, Angus Campbell, Herbert Simon, 
Ralph Linton, Fredrick Redlich, August Hollingshead, Warren Dunham, and Robert Faris. 
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more prominent ones. The primary commitment of these centers was 
to interdisciplinary research on social psychological topics using sample 
survey research methods. The Armed Services established similar research 
centers to investigate problems related to their military mission. During this 
period too the interdisciplinary research program in social psychology was 
developed in the Laboratory of Socioenviconmental Studies of the Nation- 
al Institute of Health under the direction of John Clausen and, later, Mel- 
vin Kohn. 

There certainly was no lack of interest in the interdisciplinery graduate 
training programs on the part of social psychologists and graduate students. 
Social psychology was a challenging intellectual field and students were 
anxious to learn more about it, including what other disciplines than their own 
had to contribute to its theory and methods. Moreover, there was a backlog of 
mature graduate students whose education had been interrupted by military 
service and who could qualify for financial support for their graduate training 
under the GI Bill. Still others could be supported by training grants from the 
National Institute of Mental Health, National Institute of General Medicine, 
the National Science Foundation, and other agencies interested in increasing 
the supply of persons trained in social psychology. Thus, it seemed that all 
conditions were right for sustained growth of a new interdisciplinary field of 
social psychology. 

For a decade or so great progress was made, particularly in the in- 
terdisciplinary training program at Michigan, under the leadership of Theo- 
dore Newcomb. The faculty included Angas Campbell, Dorwin Cartright, 
J. R. P. French, William Gamson, Daniel Katz, Robert Kahn, Herbert Kell- 
man, Helen Peak, Albert Reiss, Guy E. Swanson, and Howard Schuman. 
At Harvard, the new Department of Social Relations was headed by Tal- 
cott Parsons, with a faculty that included Gordon Allport, R. Freed Bales, 
George Homans, Alex Inkeles, Clyde anc Florence Kluckhohn, Gardner 
Lindzey, Frederick Mosteller, Richard Sclomon, and Samuel Stouffer.’ 
The growth was less spectacular in other universities but was by no means 
insignificant. With such a good start why did the interdisciplinary pro- 
grams in social psychology all but vanish oy the late 1960s, without ever 
becoming established in the institutional structure of American univer- 
sities? 


` 


*For information on the work of these and related -esearch centers see Bowers (1967). 

7To my knowledge only the Harvard program was set up as a separate department. The usual 
pattern was the one followed at Michigan where the socal psychology program resulted from the 
joint sponsorship at the departments of sociology and asychology; it never had the status of a 
separate department. I suspect the separate department at Harvard resulted not only from Talcott 
Parson’s interest in social relations but also from his desire to escape Pitirim Sorokin’s adminis- 
trative control. , 
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Why Did Interdisciplinary Programs In Social Psychology 
Fail? : 


The reasons for the failure of interdisciplinary programs in social psychology 
are complex and not entirely apparent. I believe one reason is the traditional 
institutional structure of American universities and the place of the social 
sciences in that structure. Another factor closely related to this is the system 
for funding science that has developed and become institutionalized in the 
United States and the unfavorable position of the social sciences in this 
system.® Other reasons may be found within social psychology itself, particu- 
larly in the condition of social psychological theory and methods. 


The Threat to the Departmental Structure. 


I turn first to the traditional institutional structure of the American university 
and the relatively weak position of the social sciences there. The physical and 
biological sciences, in both their pure and applied branches, are in a position 
superior to the social sciences and humanities in most of our universities. This 
is true with respect to the funds allocated to research, to buildings, to 
equipment, and to salaries, but particularly to new ventures such as research 
centers and training programs. Thus, the existing social science departments 
have to defend their turf in the face of new interdisciplinary programs that 
might threaten their claim on the universities resources. This is much less true 
in the natural sciences where more ample funds are available from both local 
and national sources. Thus, social science departments tend to be much less 
supportive of interdisciplinary programs, unless additional funds for them can 
be brought in from the outside. This is particularly so when the program is 
likely to require faculty, scholarships, space, equipment, and operating funds, 
always in short supply, that may draw faculty and students away from the 
parent departments. To be sure several universities were willing to give 
limited support to interdisciplinary training programs in social psychology, 
but for the most part the faculty were part-time in the program and were 
budgeted to their original departments. Funds for subsidizing graduate stu- 
dents, faculty research, secretarial and clerical staff were expected to come 
from outside grants. Federal funds to meet these costs, although available, 
were usually inadequate and were granted for a relatively short period— 
usually three to five years—-with no assurance they would be renewed. Given 
these conditions it is not surprising that the parent departments found in- 
terdisciplinary social psychology programs threatening. 

Wisconsin provides a classic example of the point I have been making. 
Early on, I obtained funds from the Social Science Research Council to set up 
a faculty seminar made up of psychologists, anthropologists, and sociologists 
to draw up plans for a graduate interdisciplinary training program in social 


®For further discussion of this topic see Sewell (1988). 
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psychology. We met for several months and developed a program consisting 
mainly of courses in social psychology already being taught in the de- 
partments, plus two new seminars: one on carrent social psychological theory 
and the other on current research methods in social psychology. The dean of 
our college supported the plan, subject to the approval of the departments 
involved, with the understanding that the departments would provide the 
faculty from their current budgets. It was further assumed that the group 
would seek outside funds for the subsidy of graduate students and for other 
requisite costs. When the plan was presented to the departments, neither 
sociology nor psychology would approve of our request for a joint major in 
social psychology. The best either department would settle for was an in- 
terdepartmental minor, with psychology req tiring that students who were not 
psychology majors take the psychology pros=minar, and sociology stipulating 
similar requirements for psychology students. The Graduate School approved 
of these arrangements, but the graduate students did not find these require- 
ments attractive and within a few years the program ceased to exist. This did 
not mean that thereafter no interdisciplinary training in social psychology 
could be found at Wisconsin; the sociologica. social psychologists encouraged 
their students to take courses from psychobogical social psychologists, and 
vice versa, but nothing like a true interdiscip.inary program emerged. I do not 
claim that the Wisconsin experience was typical of other programs, but I do 
know that several others suffered from less than adequate support by their 
departments and deans. In some institutions interdisciplinary programs pros- 
pered only so long as their enthusiastic and powerful founders participated in 
the program and strongly supported it in their departments. When they were 
replaced, their successors tended to lack the enthusiasm and often the power 
and organizational skills of the founders. Conflicts arose; the departments and 
college administrations withdrew their support, and the programs were soon 
abandoned. 


Lack of Adequate Funding 


Another factor in the decline of interdisciplirary social psychology programs 
was that they never received adequate funding from federal sources. This may 
seem paradoxical because it was during the period of their ascendency that 
funding for social science research and training became institutionalized as a 
part of the program for the support of science in the United States, particularly 
in the National Institutes of Health and later in the National Science 
Foundation.” Social psychology was especia_ly favored in the research grant 
and training programs of the National Institute of Mental Health. However, 


°For a brief discussion of the early development and importance of these sources of support, 
see Sewell (1988). 
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the funds going to the support of social sciences in these agencies never were 
more than 10% of their research budgets, and social psychology got far less 
than other branches of psychology and less than some other subfields of 
sociology. During this period funds became available for research and train- 
ing in medical sociology, social problems, urban problems, juvenile de- 
linquency, substance abuse, and aging. All of these involved social psycho- 
logical research but still were competitive with programs for interdisciplinary 
social psychology research and training. 

In any event, the major source of funds available for social psychology was 
NIMH. These funds had to be justified on the basis of mental health rele- 
vance, were modest in amount, limited in duration, provided for only a small 
number of research assistants or trainees, did not provide facilities, and 
usually supported only a limited portion of the salary of the principal in- 
vestigator or director of the program. The consequence was that these pro- 
grams were inadequately supported by either the universities or the federal 
agencies. This is in rather sharp contrast to the ever-increasing funds available 
to interdisciplinary research and training programs in the natural sciences 
during this period. © 

Unfortunately, the national survey research centers at Michigan and Chica- 
go were somewhat underused as a sourcé of interdisciplinary research training 
in social psychology. These not-for-profit organizations were only loosely 
connected with the universities, had their own staff, received limited financial 
assistance from the universities, and had to raise their own funds by doing 
contract work for government and private business. I do not mean to imply 
that they provided no support for social psychology programs. A limited 
number of their members participated in training programs, and both orga- 
nizations provided part-time employment for a number of graduate students. 
They also made their research facilities available to faculty members who 
wished to subcontract with them for data gathering, data processing, and 
related services. Generally this was possible only when faculty members had 
outside grants for these purposes. Much the same situation held at other 
university sponsored survey research centers with the exception of Colum- 
bia’s Bureau of Applied Social Research, where graduate students received 
coordinated training in Lazarsfeld’s research methods seminar along with 
first-hand experience on the Bureau’s ongoing social research projects. 


Modest Advances in Theory 


In speculating about the fate of interdisciplinary social psychology, I must 
point out that there have been no powerful theoretical breakthroughs that 
might have served as a stimulus to exciting new theoretical developments or 
new research areas during this period, or for that matter since. Advances in 
social psychological theory did occur, but they were modest. Although some 
codification took place, nothing approaching a unified body of social psycho- 
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logical theory emerged. Rather, there were improvements in somewhat iso- 
lated bodies of special social psychological -heories, such as role theory, field 
theory, attitude theory, socialization theory. theory of interpersonal relations, 
communication theory, theory of collective behavior, and theory of small 
group processes. Perhaps the most exciting advances were made in our 
knowledge of small group processes, with fruitful work being done on group 
structure, cohesiveness, communication flow, leadership, productivity, de- 
viance, and the construction of social reality. Unfortunately most of the 
bodies of special theories mentioned above “to paraphrase Robert K. Merton; 
1949 : 85-87) consisted of general orientaticns toward problems and types of 
variables to be taken into account, rather than verifiable statements of rela- 
tionships between sets of specified variables. There was little or no consolida- 
tion of these special theories into a general conceptual scheme for social 
psychology. The fate of social psychology ir this regard was no ditferent from 
that of the social sciences generally. In fact could be argued that none of the 
social sciences made spectacular progress in developing general theory during 
these years. 

What did take place was a great burst of research activity on a large number 
of social psychological topics, often with a view toward shedding light on 
problems of social behavior, rather than toward theory construction or testing. 
This is clearly reflected in the articles selected for the influential book, 
Readings in Social Psychology, edited by Theodore Newcomb and Eugene 
Hartley in 1947 and revised in 1952 and 1957. This book was sponsored by 
the Society .for the Psychological Study of Social Issues. Its editors did not 
attempt to provide an overall framework for social psychology; instead they 
stressed three requirements for research in sdcial psychology: It must adhere 
to rigorous canons of scientific procedure; it must draw hypotheses from the 
relevant psychological and social sciences; and it must bring these hypotheses 
to bear on systematic research on problems o7 human importance (Likert 1947 
V). This final requirement characterized much of the research done during 
this period. It must be remembered that a great volume of research was done, 
enough to fill professional journals in the field and, for that matter, hundreds 
of pages in the more general journals of the various social sciences. Most of 
the pages in the five volumes of The Handbcok of Social Psycholegy (1969), 
edited by Gardner Lindzey and Elliot Aronson, are devoted to summarizing 
this research record. But only a small fraction of this research resulted from 
interdisciplinary efforts. 

Unfortunately, little of this outpouring cf research resulted in powerful 
ideas that could stimulate further development of theory or research in social 
psychology. Rather, with few exceptions, the explanatory power of the 
theories and models in social psychology remained quite modest—often 
providing only small, though statistically significant, results. This is not the 
stuff that makes for a stimulating new interdisciplinary field; and it certainly 
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does not command the long-term, high level of financial support that in- 
terdisciplinary programs need to be successful. 


Advances in Research Methods 


During this period of great research activity much effort was devoted to the 
improvement of research methods, particularly in sampling, interviewing, 
questionnaire construction, index and scale development, observational tech- 
niques, and statistical methods for the analysis of survey data. Because of the 
rapid adoption and use of sample survey methods during and after the war, the 
government and the public were raising questions about the adequacy and 
dependability of existing methods of sampling, interviewing, and data analy- 
sis. Consequently, the National Research Council and the Social Science 
Research Council jointly sponsored a committee, under the chairmanship of 
Samuel A. Stouffer, to investigate these questions. The committee in turn 
commissioned a study on interviewing under the direction of Herbert H. 
Hyman and one on sampling under the direction of Fredrick F. Stephan and 
Phillip McCarthy. The results were Hyman’s Interviewing in Social Research 
(1954) and Stephan and McCarthy’s Sampling Opinions (1958). Hyman’s 
book not only brought together what was then known about interviewing, it 
also reported on a series of careful experimental and observational studies of 
sources of error in interviews, and their control. This book had a great 
influence on the work of survey agencies and on the teaching of survey 
research methods. Hyman also produced another influential book, Survey 
Design and Analysis (1955), that presented a series of detailed case studies of 
problems encountered in social research. This book grew out of Paul F. 
Lazarsfeld’s well-known project at Columbia, designed to produce materials 
suitable for advanced training in social research. An equally important book 
from the Columbia project was Paul F. Lazarsfeld and Morris Rosenberg, The 
Language of Social Research (1955), which emphasized the use of partialling 
to contro] for the influence of intervening variables in studying causal rela- 
tionships, and of contextual analysis to separate individual and group effects 
(see also Kendall & Lazarsfeld 1950 and Lazarsfeld & Menzel 1961). 

The Stephan & McCarthy book (1958) describes the relationship between 
sampling and other components of survey design, the problems raised when 
methods do not conform to the underlying mathematical theory, and finally, 
the problems encountered in actually designing a sample survey and putting it 
into operation. This book was by no means a primer; it was quite influential in 
survey research operations and was widely used in survey research methods 
courses. Other important books on sampling during the period were William 
Edwards Deming, Some Theory of Sampling, (1950) and Morris H. Hansen, 
William Hurwitz, and William Madow, Sample Survey Methods and Theory, 
(1958). 

Mention should also be made of the book by Marie Jahoda, Morton 
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Deutsch, and Stewart Cook, Research Methods in Social Relations (1951), 
which covered research design, observational techniques, survey methods, 
content analysis, measurement, and data analysis. This book, sponsored by 
the Society for the Psychological Issues, was widely used in introductory 
courses in research methods in social psychology. A more advanced text 
covering much the same subject matter was Research Methods in Social 
Sciences (1953), edited by two prominent social psychologists, Leon Festing- 
er and Daniel Katz. A book used by sociologists teaching research methods in 
social psychology was Sociological Studies in Scale Analysis (1954) by Riley 
et al. Another important book is R. Freed Bales’s Interaction Process Analy- 
sis (1950) which provided sociological social psychologists with a system that 
enabled them to observe and rate the behavior of members of small groups. 
This system was widely adopted by yourger sociologists and produced a 
generation of social psychologists who continue to work on important prob- 
lems of individual and group behavior. 

Considerable progress was also made in tae measurement of social psycho- 
logical variables during this period. Of course, even before the period began, 
L. L. Thurstone (1928), Rensis Likert (1932), and others had developed 
useful techniques for scaling attitudes, opinions, and similar social psycho- 
logical constructs. But early in this period (during World War II) Louis 
Guttman (1944, 1950) developed Scalegram Analysis for determining rank 
order. This technique came to be known as Guttman Scaling. Scalegram 
Analysis, which is easy to accomplish and produces readily understandable 
results, was widely adopted. It doubtless was a great stimulus to research on 
attitudes and to studies of attitude change, which was a popular topic during 
this time of great concern with intergroup relations. Guttman Scaling soon 
replaced the earlier techniques for scale and test construction and was used to 
measure a wide range of social science variables. Mention must also be made 
of Paul F. Lazarsfeld’s development of Lacent Structure Method (1950) by 
which the manifest relations between any two items in a questionnaire can be 
accounted for by a simple set of latent classes, and only by this set. This was 
an important contribution to scaling theory although it did not gain wide- 
spread use by social psychologists. +° 


‘©PDuring the later part of this period social psychologists began to develop scales and indexes 
by the use of factor analytic methods. Computer programs were developed to factor analyze and 
assign factor weights to a large number of items with g-eat speed; this made scaling a quick and 
cheap process. Scales were produced to measure almost any social psychological variable that 
anyone could wish for. Unfortunately, according to Otis Dudley Duncan (1984a: 119-155) who 
provides a number of examples, most of these scales © not meet even minimum measurement 
requirements and hence they produce misleading results The matter is made still worse when the 
numbers produced by these scales are subjected to comp.ex statistical analysis. Duncan concludes 
that until we are ready to do the hard thinking and caref.l analysis that the methods developed by 
Georg Rasch (1968; Perline, et al 1979; Duncan 1984b] require, there is little hope for adequate 
measurement scales in social psychology or other soczl sciences for that matter. 
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Probably the greatest area of advance in this period was in the use of 
computer technology and methods. We started the period using counting 
sorters for most of our research. I remember the hours-that my wife and I 
spent in 1938 feeding IBM cards into a counting sorter to get the numbers I 
used in the analysis of the 123 items ‘from which I selected the 36 most 
diagnostic ones that finally comprised the farm family socioeconomic status 
scale (Sewell 1940). By the early 1950s, however, we had computers that, 
although crude by present day standards, had sufficient speed and storage 
capacity to-enable us to do some quite complicated multivariate statistical 
analysis. For example, my colleagues and I were able to do a factor analysis 
of a set of 38 child-training practices to test hypotheses concerning the 
psychoanalytic claim that the mother’s child-training practices reflect her 
unconscious acceptance or rejection of her child (Sewell et al 1955). In- 
cidentally, the results failed to confirm the hypothesis. The computer proved 
to be very useful to social psychologists in multivariate cross-tabular analysis 
based on large samples, such as those my colleagues and I used to parse out 
the influence of social background variables on educational and occupational 
aspirations (Sewell et al 1957). Improvements in computer technology also 
made it possible for social psychologists to begin large-scale longitudinal and 
panel surveys. One was the Wisconsin study of social and psychological 
factors in the educational and occupational aspirations and attainments of over 
10,000 students who graduated from high school in 1957. (See Sewell & 
Hauser 1975 for a summary of the early work on this project). Finally, by the 
time this period came to a close many new computer programs enabled social 
psychologists to use quite advanced mathematical statistical models in their 
research. Of course, social psychologists were not the major contributors to 
these mathematical and statistical techniques or to computer technology but, 
like other scholars, they were quick to adopt such techniques once they 
became available. 

In closing I must say that although there were important improvements in 
the research methods used during this period, their main effect was to increase 
the reliability of our observations rather than to extend our powers of observa- 
tion. New computers and computer programs did help us to sort out some of 
the complexities of social psychological behavior that would been almost 
impossible with earlier techniques. However, none of this was sufficient to 
bring about major theoretical breakthroughs to fuel great advances in social 
psychology. 


Summary and Conclusions 


Unfortunately, the rather modest developments that took place in social 
psychological theory and methods during its Golden Age were not sufficient 
to serve as the basis for a new interdisciplinary field. This was particularly 
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true because of social psychology’s weak position in the university structure 
and the inadequate funding available from university and federal sources. 
Contrast this with the success of the interdisciplinary programs in the natural 
sciences—particularly with those in molecular biology, where tremendous 
theoretical breakthroughs, stemming from the work of Watson, Crick, and 
Wilkins on the structure of DNA, provided the stimulus for a whole new 
approach to biological studies.'' This, plus the perfection of powerful new 
instruments for observation and measurement (such as the election micro- 
scope and several complex devices and techniques for studying large and 
small molecules) spawned complex research problems that could only be 
solved by bringing together the skills and knowledge of physicists, chemists, 
geneticists, bacteriologists, zoologists, and botanists. Usually it was the 
younger scientists in these fields who wer= willing to engage in this joint 
effort and to learn the new techniques necessary for success in solving new 
problems. In the early years of the new programs, most of the scientists 
involved maintained their departmental conaections but did their research in 
molecular biology teams. The level of cooperation of the parent departments 
with the interdisciplinary programs was not uniformly high in these years, but 
there was no great departmental resistance because adequate funding was 
available to permit other scholars in the departments to continue their es- 
tablished research programs. At the same time there was plenty of new money 
` for the support of molecular biology. In fact NIH and NSF were so anxious to 
promote interdisciplinary programs in molecular biology that they were will- 
ing to provide funds for new buildings, laboratories, and equipment as well as 
salary support for faculty members and ample stipends for pre- and postdoc- 
toral trainees. Over the years support has continued at high levels for both 
training and research in molecular biology. In several instances molecular 
biology has been granted full departmental s-atus and in all instances has had 
the power to set its own graduate requirements and to grant its own PhD 
degrees. 

One cannot help but wonder what would heve happened if generous support 
from the universities and the federal funding agencies had been available to 


'IMy comments on molecular biology are based cn an interview with Robert M. Bock, 
professor of biochemistry and molecular biology and dean of the graduate school, University of 
Wisconsin-Madison. The Wisconsin program in molecular biology (now called Cellular and 
Molecular Biology) began in 1952 with five professors fsom biochemistry, genetics, and physics, 
with support from the Graduate School, NIH, and NSF. Funds for a new ten-story building with 
completely equipped laboratories and offices were prov-ded from federal sources. The program 
now includes 82 professors from 17 departments. All cf its 100 or more graduate students are 
guaranteed three years support from University fellowships, or NIH traineeships or Research 
Assistantships. Teaching costs are now bore by the Unrversity. Research support comes mainly 
from federal sources. The program has never sought cepartmental status and maintains great 
flexibility in its research and teaching functions. 
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interdisciplinary programs in social psychology. Probably, no great new 
theoretical breakthroughs would have occurred or powerful new instruments 
or techniques of research would have been developed. The nature of social 
psychological phenomena makes such developments very difficult. But 
adequate and appropriate funding would have increased the probability of 
important developments in these areas and would most certainly have made 
greater progress in the improvement of social psychological theory and 
methods. I am confident that we would have been turning out more good 
research and more well-trained social psychologists had sociologists been 
collaborating with psychologists and other social scientists in interdisciplinary 
social psychology research and training programs. We might have been 
making much greater contributions to the understanding of some of the 
common social problems of our time.'* This in turn probably would have led 
to greater support for research and training in social psychology. 
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LITIGATION AND: SOCIETY 


Lawrence M. Friedman 
Law School, Stanford University, Stanford, CA 94305-8610 
Abstract 


Litigation, in ordinary speech, refers to actions contested in court; this 
involves a claim, a dispute or conflict, and the use of a specific institution, the 
court, to resolve the conflict or dispute. In the past most legal research has 
consisted of analysis of doctrine and theory about doctrine. But litigation is an 
important phenomenon in its own right and research lately has shown this. 
This chapter aims to sketch out a few major areas of research and theory and 
to add a few brief remarks about the significance of the work thus far. The 
topics covered include: dispute-centered and court-centered research; quantity 
of litigation and the so-called litigation explosion; and the impact of litigation 
on society. 


INTRODUCTION 


Only in the last decade or so has there been substantial research on litigation, 
or theorizing about the social meaning and impact of litigation, even among 
law and society scholars and those who identify themselves as sociologists of 
law. To be sure, legal scholarship in common law countries has always been 
obsessed with appellate litigation; but what jurists considered “research” 
consisted mostly of analysis of doctrine, and theory about doctrine, all quite 
formalistic- and never quantitative or empirical. In other countries, legal 
scholars have had even less interest in conducting systematic research on 
litigation. For their part, social scientists have tended to neglect litigation as; 
well. This was perhaps originally a reaction against the tendency of legal 
research to act as if the law was nothing but formal process. Sociology, then, 
took as its domain informal -processes—law-related behavior that took place 
outside the courtroom setting. 

But litigation is an important phenomenon in its own right, and this has 
become very obvious in the last two generations. The school desegregation 
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cases, and in general the activities of the United States Supreme Court under 
Earl Warren, suggested the potential for social change through litigation. 
More recently, the so-called “litigation explosion” (Galanter 1983) has led to 
speculation about the harmful social effects of litigation, real or imagined. 
The discussion is by no means confined to the United States, although di 
United States is usually seen as the worst offender. 

In one sense, to be sure, the. large literature on courts, judges, juries, 
litigants, and the like, in sociology, criminology, legal history, political 
science, psychology, economics, and anthropology, is all relevant to the 
problem of litigation; but I work here with a somewhat narrower and more 
manageable concept of the field. My aim has been to sketch out a few major 
headings of research and theory, and to acd a few brief remarks about the 
Significance of the work thus far. We therefore ignore many of the cognate 
questions and fields of research—for examele, the truly vast literature on the 
jury, and on the dynamics of jury deliberation and decision-making (for an 
overview, see Hans & Vidmar 1986). 


Litigation Defined 


No definition of litigation commands generel agreement; indeed, most of the 
literature on courts makes no attempt to def_ne litigation at all. Litigation, in 
= ordinary speech, refers to actions contested in court. The core meaning thus 
implies three distinct elements:. first, a clam, that is, an active attempt to 
attain some valued end; second, a dispute cr conflict, in other words, resis- 
tance to the claim; and third, the use of a specific institution, the court, to 
resolve the conflict or dispute. 

This definition may seem banal, but there are subtle choices implicit in the 
formulation. The phrase “action contested in court” implies that the contest 
takes place inside the courtroom. But courts have other functions besides 
litigation. In the life cycle of many disputes. the final act of the drama, or its 
(apparent) resolution, takes place inside the courtroom; and yet the court stage 
is (arguably) not true litigation. 

_ -For example, only a court can grant a divorce; and parties to a divorce are 
often in serious dispute over the division of property rights, support pay- 
ments, or custody of children. But the courtroom phase may-range all the way 
from bitter and protracted battles to perfunctory and routine paper-shuffling. 
Indeed, in the typical case, the problems are worked out, usually with the help 
of lawyers, long before any of the actors appears before the judge. What is 
presented to her honor is a package of agreements already settled; she merely 
rubber-stamps these prior agreements. Unde- our definition, divorces of this 
type, though judicial statistics report them as “cases,” are not litigation, 
because they are not contested in court. Since the case-loads of many trial 
courts. are dominated, quantitatively, by completely uncontested divorces, to 
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consider such cases “litigation” would seriously distort the figures on litiga- 
tion rates. These cases, in the aggregate, may demand time and effort; but 
they do not seriously overburden the system as the more complex cases do, 
which are actually tried. 


Dispute-Centered and Court-Centered Research 


There are at least two distinct approaches to the study of litigation. Some 
scholars are primarily interested in disputes themselves, as a social phe- — 
nomenon—their causes and cures. For them, litigation is only of interest as a 
phase in the life-cycle of disputes. Other scholars are primarily interested in 
courts as institutions. They are thus not directly concerned with conflicts and 
disputes that never reach the courts, except insofar as court decisions have an 
impact on such disputes, for example, by influencing the terms of settlements 
(see Ross 1970). 

Research on disputes suffers from the difficulties of defining and measuring 
disputes. Court statistics are, on the whole, quite poor; statistics on the 
population of disputes are worse, or non-existent. Although it is not easy to 
count defamation cases, for example, it is thoroughly impossible to measure 
the population of insults, or the population of disputes in society that arise out 
of insults. Sometimes there is data on the number of potential disputes—for 
example, data on serious railroad or automobile accidents—which give a kind 
of base-line for measuring possible lawsuits (see Friedman 1987, Munger 
1987). Victim studies in recent years have provided base-line data for measur- 
ing the relationship between criminal acts and the number of arrests, charges, 
and cases (see Hindelang et al 1976). But these are exceptions. 

In recent years, a start has been made in studying the life-cycle of disputes, 
from their beginnings in the social field, to the point where a few of them 
“ripen” into lawsuits. The most ambitious attempt has been the Civil Litiga- 
tion Research Project (CLRP) at the University of Wisconsin. CLRP used the 
dispute as its unit of analysis (Trubek 1980-1981), in order to link the work of 
courts with the social context out of which disputes arise. The researchers did 
not confine themselves to disputes that ended up in court. They also drew a 
random sample of households and organizations to identify disputes that never 
got as far as court (Kritzer 1980-1981). 

Dispute-centered research asks why some situations produce disputes, and 
what happens to these disputes. What are the switching devices (so to speak) 
that shunt some disputes onto a court track, while others disappear or are 
diverted into alternative modes of resolution? Logically prior to the study of 
litigation, then, is study of the “transformation” of “injurious experiences” 
into legal claims. Felstiner et al (1980-1981; see also Fitzgerald & Dickins 
1980-1981) describe a three-stage process of transformation. First, in- 
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dividuals perceive themselves as undergoing an “injurious experience” (they 
call this “naming”); next, the experience becomes a grievance (“blaming”); 
the third stage is “claiming,” that is, the process of turning to the responsible. 
party and asserting a demand for remedial action. At this point, a dispute has 
emerged, which, if not settled or arranged beforehand, may end up as actual 
litigation. 

One theoretical and empirical issue is why such “transformations” occur or 
do no occur. Clusters of factors can be isolated. One cluster is substantive 
(based on “rules of law” or “doctrines”). In Paraguay, the law does not allow 
absolute divorce; this is a substantive barrier, which keeps many kinds of 
marital dispute out of court. Other factors are institutional or procedural: the 
expense of lawsuits, or the steps that make litigation simple or difficult, 
whether courts are accessible or not, formal or informal, and so on. 

These various structural and procedural factors are often quite obvious. It is 
more difficult to demonstrate or assess cultural factors. It is, however, 
commonly assumed that culture (and personality) factors are critical in ex- 
plaining why (for example) the Japanese seem to litigate very little, and 
Americans a great deal. That is, “litigiousness” is posited as a specific cultural 
trait, a matter of custom, tradition, and way of life; the Japanese (it is said) 
prefer compromise and interpersonal arrangements; Americans are in- 
dividualists—battlers and sticklers for rights. Needless to say, there is very 
little hard research on such issues. In fact, it is not clear whether cultural 
factors best explain varying rates of litigation, or whether structural and 
substantive barriers should be invoked (see Upham 1987, Fitzgerald 1983). 
Most probably, litigation rates are the product of multiple factors—including 
the sheer difficulty and expense of litigation. Doctrinal and structural barriers, 
after all, are not generated out of thin air, but are durable or permanent 
patterns formed out of “softer” cultural phenomena. 

In one sense it is misleading to talk about “barriers” to litigation; this 
assumes that it is normal or natural for a dispute to end up in litigation. In fact, 
most people do not pursue their grievances at all. Claims-consciousness is 
related to class—better educated, more articulate people are more apt to insist 
on their rights (Caplovitz 1963, Best & Andreasen 1977). Even when griev- 
ances mature into “disputes,” they do not necessarily become lawsuits. Most 
disputes disappear or are settled long before the trial stage, and this has 
apparently been true for at least a century (see Daniels 1985, Friedman & 
Percival 1976). From one theoretical standpoint, indeed; every trial is a 
mistake in calculation. It is almost always in the interests of the parties to 
settle; trials are socially disruptive, and people in continuing or community 
relationships tend to avoid them (Macaulay 1963, Engel 1984, Ellickson 
1986). Moreover, trials are costly affairs, and in civil disputes; there is 
usually a zone of settlement or range of values at which both parties are better 
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off if they settle (see Ross 1970). Trials result when parties seriously misjudge 
the likely outcome of a trial or insist on litigation in order to establish some 
principle. . 

The dispute that ends up in court has been transformed in another way, too. 
It has been, necessarily, translated from raw, lay norms and descriptions, into 
legal categories; it has been encoded and reworked to fit the traditions and the 
habits of internal legal culture (on this concept, see Friedman 1975:223). In 
the process, the dispute itself has been subtly or not so subtly altered. 
Lawyers, then, who do the translation in this and most modern societies, 
exercise control over disputes and their outcomes by virtue of their command 
of the language and the traditions which the legal system legitimates and to 
which it assigns a privileged place. There has been, unfortunately, very little 
systematic work on this process of translation and transformation (but see 
Mather & Yngvesson 1980-1981). It is clear, however, that the practice can 
be more or less “participatory” or autocratic; and that the style of lawyering 
makes a difference to the outcome of cases (see Rosenthal 1974). 

Whether disputes end up in court also depends on the definition of a court 
(see Shapiro 1981). Institutions called “courts” in this society perform tasks 
other than dispute-settlement. They have administrative responsibilities, for 
example—probating estates, or formalizing name changes. On the other 
hand, many institutions imitate the courts, or use courtlike processes, without 
the name or the official status. 

To begin with, in some societies there are “tribunals” which exist apart 
from the formal court system. In many societies, too, arbitration is a common 
process, substituting for “regular” judicial progress. Arbitration differs from 
“litigation” chiefly in that the arbitrator is only a temporary judge—usually 
selected by the parties—rather than a state official. The spread of “due- 
process” within institutions, government agencies, and other large organiza- 
tions, in addition, has meant that internal dispute-settling or grievance pro- 
cedures exist throughout society, institutionally very much like courts; in 
some instances, the parties may even use lawyers to help them prepare or 
argue their “case” (Macaulay 1987). 

The court system in the United States, and in most modem nations, is 
exceedingly complex. There are civil and criminal courts, sometimes run as 
separate institutions; petty courts, trial courts, and intermediate appellate 
courts (in most states), and state supreme courts (see Kagan et al 1978); there 
is also the three-tier system of federal courts (see, for example, Howard 
1981). Each level can be a separate object of study. There are also specialized 
courts—in Europe, labor courts and administrative courts are quite prominent; 
and there are supra-national courts, in the European Economic Community, 
for example. In each society, courts occupy a specific position in the structure 
of government and have a distinct role in and impact on society. There is also 
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a great deal of interest in “alternative dispute resolution”—:modes of dealing 
with conflict and dispute that avoid the formal (state-run) courts. Arbitration 
has already been mentioned. “ADR” was not only a field of research; in the 
1970s it became something of a social movement—a reaction against the 
formal court system, in the interests of efficiency and greater access to justice, 
especially for the poor (see Abel 1982). 





Court Centered Research: The Quantity of Litigation and the 
So-Called Litigation Explosion 


It is commonly assumed that the United States is a highly litigious society and 
that litigation rates have been rising rapidly in recent periods. This rough 
hypothesis appears in popular literature, in the press, and in the speeches of 
judges and politicians. It is also assumed that the effect of the explosion of 
lawsuits, especially tort lawsuits, is harmful to the economy, if not to the very 
make-up of society (Rabin 1988). Fear of litigation stifles innovation, and 
leads to conservative, “defensive” strategies in business and medicine; munic- 
ipal liability has led to the closing of playgrounds, the cancelling of programs, 
and even to urban bankauptcy. But these effects are difficult to demonstrate 
empirically (see below). 

A number of studies have tried to measure American litigation rates over 
fairly long timespans (often a century or more). These studies have, on the 
whole, failed to document the “litigation explosion.” Thus McIntosh (1981) 
studied courts in St. Louis, Missouri, between 1820 and 1977. Litigation rates 
dropped in the last half of the nineteenth century, then rose and fell and rose in 
the twentieth century, but hardly dramatically. The “litigation rate” in the 
1970s was higher than it had been a century before; but 45% of McIntosh’s 
cases were family law cases, almost all of them uncontested divorces. “Litiga- 
tion” in the sense of actual contests in courts in fact was perhaps lower in 
proportion to population in the 1970s than in the 1850s (see also Friedman & 
Percival 1976, Munger 1988). On the whole, those who have studied litiga- 
tion rates tend to agree that there are no signs in state courts of a quantitative 
explosion (for the literature, Galanter 1983; a dissenting note is Marvell 
1987). Filings in federal court, however, are an exception; there is no 
question that the number of such cases has been increasing far faster than has 
population size (see Clark 1981). But the overwhelming majority of cases 
filed—over 90%—-are filed in state courts. It is the state courts that handle 
almost all cases of family law, personal injury, and criminal justice, and the 
overwhelming bulk of ordinary commercial matters. No increase on the 
federal scale can be documented for state courts. Gifford & Nye (1987), 
examining recent data, found evidence that litigation rates in Florida were 
rising more rapidly than population rates. The Florida data, however, thus far 
seem exceptional. 
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The federal data do point toward a more interesting and promising issue: 
changes in the type of case over time. The increase in federal filings is not 
difficult to understand, in light of the increased role of the central government 
in economy and society, relative to the states. It reflects the dominance of 
federal regulatory and welfare law; it also reflects the activism of the federal 
courts—and, what is often forgotten, the activism of Congress. There are 
thousands of civil rights cases in federal courts; 50 years ago there were 
virtually none. Though some of these cases invoke constitutional rights, or 
post-Civil War legislation, the vast bulk of them arise under the Civil Rights 
statutes passed by Congress in the 1960s. Unfortunately, despite the enor- 
mous literature on civil rights, empirical research on civil rights litigation is 
rare (but see Eisenberg 1982). 

The federal courts, too, are the home of the preponderance of large, 
complex “public law” cases (Chayes 1976), which contrast so strongly with 
traditional private litigation. In these “public law” cases the issues go far 
beyond the parties, and the court frames broad remedies and maintains 
continuing jurisdiction. The major school desegregation cases are examples; 
or the long struggles to reform prison systems through litigation. There have 
been examples of “public law” cases in the United States since the nineteenth 
century; but there is no question that cases such as Chayes describes are more 
frequent today than before. Unfortunately, there is no agreed-on definition to 
mark off the boundaries of this case-type, and thus no data on the precise 
number of such cases. However, their importance is beyond dispute. It is also 
likely that they account for a good deal of the time and effort spent on 
litigation. The legal profession has been growing very rapidly in the United 
States—there were perhaps 60,000 lawyers in 1880, about 104,000 in 1900 
(Friedman 1985:633), and over 650,000 in 1985 (Curran et al 1986:1); the 
number continues to increase steadily. Law firms have been getting very 
much larger as well (Nelson 1988:2); the largest firm today, Baker & Mac- 
Kenzie, has over 1,000 lawyers on its staff; and a greater share of the effort of 
the large firms seems to be devoted to litigation (Galanter & Rogers 1988). 
“Public law” cases, along with a few giant private cases (antitrust, for 
example), account for much and perhaps most of this effort. 

Longitudinal studies of state trial courts have also looked at changes in the 
mix of business that courts handle (see, for example, Friedman & Percival, 
1976). Compared to the nineteenth century, commercial cases and ordinary 
property cases account for a lesser share of the caseload of the state courts. 
Personal injury cases. family law cases, and public law cases have increased 
in number and percentage. Courts have been spending less of their time on 
market-oriented disputes, and more on disputes that have an expressive, 
personal element. The only large-scale quantitative study of state appellate 
litigation came to a similar conclusion. This was a study of sixteen state 


24 FRIEDMAN 


supreme courts between 1870 and 1970. For.1870—-1900, debt, contract, and 
real property cases made up 55% of the case-load. More “personal” issues— 
tort, criminal law, and family issues—added up to less than 30%. For the 
period 1940-1970, these proportions were reversed; debt, contract, and prop- 
erty had fallen to 25.9%, but tort, crime, and family issues now aggregated 
52.3%. However, there has been something of a resurgence of contract and 
business litigation in recent years, at least in federal court (see Galanter & 
Rogers 1988, Nelson 1988). i 

How can one make sense of these various developments? The longitudinal 
data, to begin with, render extremely doubtful the assumption that there is a 
simple, linear relationship between litigation and economic development. In a 
path-breaking study, Toharia (1974) analyzed the work of the Spanish courts 
between 1900 and 1970. He found that there was, in fact, an inverse relation- 
ship between economic development and the volume of litigation. Litigation 
in the formal courts actually declined in the period he studied; the decline was 
most pronounced in the most economically advanced sectors of the country. 
Studies in other European countries generally confirmed Toharia’s findings, 
for the period in question (Blegvad et al 1973, Rottleuthner 1985). 

It is possible to connect these results with empirical and theoretical dis- 
cussions of why individuals and businesses do or do not litigate. Individuals 
and businesses with ongoing relationships avoid litigation, which is inherently 
disruptive. This is the theme of the classic article by Macaulay (1963), which 
studied the behavior of Wisconsin businessmen. Thus the recent upsurge of 
contract cases, as noted by Galanter & Rogers (1988; see Friedman 1989), 
suggests changes in the nature of business relations themselves—more volatil- 
ity, rougher competition, and less emphasis on continuing, quasi-personal 
relationships among principals of firms. Study of this new phenomenon seems 
promising, since it supplements the research on disputes by exploring models 
and devising theories to explain why firms and individuals do or do not go to 
court. 

Research on the outcomes of cases, oddly enough—as opposed to the 
quantity of litigation, or the subject matter, or the motivation—is com- 
paratively thin, although most of the longitudinal studies present at least some 
data bearing on outcomes. One starting point for research is the well-known 
article by Marc Galanter, suggesting reasons why the “ ‘haves’ come out 
ahead” in trial court litigation (Galanter 1974). To a certain extent, the 
reference to “haves” is misleading. The argument is that “repeat players”— 
those who use the courts constantly—tend to win out over “one-shot” liti- 
gants. “Repeat players” are usually, though not always, the richer and more 
powerful’ parties—the government, or big business; accident victims are 
typical “one-shotters.” 

.Galanter’s thesis is essentially structural. It is the organizational strength of 
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“repeat players” that accounts for their success. Ideology or simple class bias 
are not put forward as critical variables. Wheeler and associates have recently 
presented findings on winners and losers in state appellate courts, 1870-1970 
(1987). There the results are also favorable to the “haves,” that is, large 
organizations and government entities, though not overwhelmingly by any 
means. Recently, too, the Rand Corporation has run important studies of jury 
verdicts, mostly in personal injury: cases. Their data suggest the need to 
reexamine the usual assumption that plaintiffs consistently win at trials (see, 
for example, Shanley & Peterson 1983; for a historical study that points in the 
same direction, see Friedman 1987). 

The Toharia study raised questions about the much discussed issue of the 
“autonomy” of the legal system, specifically, the autonomy of courts and 
litigation rates. Toharia did not suggest that the legal system was, as a whole, 
“autonomous” in the sense of unconnected and independent of social forces in 
society; but his findings could be interpreted to mean that at least the formal 
courts of Spain were increasingly irrelevant to economic life. Presumably, 
these courts were locked into a tough and relatively rigid tradition, which 
rendered them incapable of adjusting rapidly to social change and the de- 
mands of modern business litigants. As a consequence, these courts were 
increasingly by-passed by economic institutions and perhaps by ordinary 
citizens as well. These were important findings, and very fruitful in stimulat- 
ing other research, much of which tended to confirm Toharia’s findings. Of 
course, it is by no means clear that the situation described can be generalized 
across cultures, or to other times, places, and institutions. Certainly, the work 
of American courts in at least some areas—civil rights and civil liberties, 
product liability, and medical malpractice—hardly suggests rigidity; there 
may be parallel changes taking place in Europe as well. Certainly, the 
explosive growth of judicial review, in West Germany for example, suggests 
a more “American” model of court use (see Bryde 1982). And when Toharia 
returned to his subject, a decade or so after his first study had been published, 
he found that the workload of the Spanish courts had risen sharply since the 
early 1970s (Toharia 1987). 


The Impact of Litigation on Society 


This is largely uncharted territory, as far as systematic study is concerned. 
One of the “impacts” of litigation, especially in a common law system, is the 
creation of the basic legal norms themselves. This “impact” is so fundamental 
that it is often taken for granted; and in a sense most of the legal literature, and 
a good deal of the social science literature on particular fields of law, concerns 
this impact. Much less common are theoretical or empirical attempts to speak 
generally about the mode in which courts frame and devise rules, and the 
nature of the rules which they are likely to frame (but see Friedman 1967). A 
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few legal scholars, in the 1970s, advanced the thesis that litigation in common 
law courts tends to evolve in the direction of “efficient” rules (see Priest 
1977); this thesis is, however, not generally accepted. 

There are also studies of how litigation in particular courts affects the 
community in which the court sits; this is a theme, for example, of much of 
the literature in legal anthropology. Courts in many societies serve the 
function of restoring harmony and balance; they are concerned less with “law” 
and “rights” than with repairing ruptures in ongoing relationships (see Nader 
1969). This is not necessarily the outcome in Western societies. Merry (1979) 
studied the efficacy of a lower court in an American urban neighborhood, for 
example, and concluded that the court was not an effective institution of 
dispute-settlement. Rather, the court was used in this community “as a 
sanction, a way of harassing an enemy,” as a “weapon marshalled by dis- 
putants to enhance their power and influence,” rather than “as a mode of 
airing and resolving disputes” (Merry 1979:919). Thus, the court does not 
repair broken ties; it may even exacerbate a tense situation. Tactical use of 
law to hurt an enemy may be particularly characteristic of colonial situations, 
where the law is an alien intrusion (Cohn 1965). 

There is a considerable literature, too, made up of so-called “impact” 
studies (Wasby 1970, Johnson & Canon 1984)—-studies of the consequences 
of particular court decisions, for example, the Supreme Court decisions 
banning prayers from public schools (Muir, 1967) or on abortion (Hansen, 
1980), or state decisions extending tort liability (see Givelber et al 1984). 
These studies focus on whether officials obey such decisions, and why, or 
whether the decisions have an influence on behavior among various affected 
publics. Such studies raise conceptual and methodological difficulties (see 
Rabin 1979); and, in any event, data about the aggregate or cumulative 
impact of courts and litigation is not easy to come by. Litigation has both 
direct and indirect effects; and the indirect effects can be subtle and difficult to 
measure. Fears of losing or hopes of winning influence the course of out-of- 
court-bargaining, as we have noted. Judicial actions have what Galanter has 
called “radiating effects” (Galanter 1987:215)—they ripple outward into the 
larger society. 

There is, to be sure, a popular literature of invective about the evil results of 
the “litigation explosion.” Litigation itself is undoubtedly expensive. It is — 
possible to gather rough figures to show how much businesses and individuals 
spend on litigation each year. It is not clear how one would go about 
measuring the more remote and consequential costs. How is one to know 
whether litigation is, or is not, stifling “innovation?” It is commonly stated 
that medical malpractice suits have led to an increase in wasteful, “defensive” 
medicine. No doubt, there have been some effects, but the studies do not 
agree on how much (Zuckerman et al 1986; 107-10; Tancredi & Barondess 
1978). Thus no one can be sure whether (say) excessive tests and lab work 
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offset the possible gains from more cautious doctoring. With regard to other 
alleged costly consequences of litigation, there are serious problems of cause 
and effect. Even assuming that one can make concrete so vague a notion as 
“decline of trust,” it is possible that “decline of trust” is the cause of certain 
forms of litigation, rather than the effect (see, in general, Friedman 1985). 

Moreover, it is striking that popular opinion and scholarly critiques alike 
put so much emphasis on hidden costs and side-effects, and show so little 
concern for hidden benefits. The benefits, to be sure, are often quite intang- 
ible and immeasurable: social justice; expanded opportunities for women and 
minorities, expansion of civil liberties, fair procedures within institutions, 
limits on government (see Galanter 1986:28—37). Who would deny that these 
are significant gains? Whether they are worth the costs is a question that 
models and equations cannot answer. 

The determinants of litigation are complex. Perhaps the key fact is that 
“litigation” is not a unitary phenomenon, and thus it would be vain to try to 
relate litigation as a process to any general sociological theory; or even to 
general theory in the sociology of law. There is no reason why the same 
theoretical apparatus would explain quantitative and qualitative aspects of the 
various types of litigation: “ordinary” litigation—-boundary disputes between 
landowners, squabbles over custody, breaches of sales contracts—as well as 
giant public-law cases, mass tort cases (for example, the literally thousands of 
asbestos suits, or the “Agent Orange” cases; see Schuck 1986), civil rights 
test cases, huge private anti-trust cases; not to mention, at the other extreme, 
routine eviction and repossession cases, and neighborhood disputes, in small 
claims or neighborhood courts. It is by no means clear that all of the giant 
“modern” lawsuits constitute a single phenomenon in themselves. | 

Judicial statistics are poorly kept, and the compilation of data sets for state 
(Kagan et al 1977) and federal cases (Carp & Rowland 1983) has only 
recently begun. But perhaps what is needed most of all is intensive, small- 
scale study of lawsuits in different courts, at different stages of the life-cycle 
of disputes. Out of this might emerge a typology of disputes and litigious 
occasions; and at that point meaningful hypotheses might be framed about the 
many forms and shapes of litigation. 
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Abstract 


After World War II a rapid development of sociology occurred in the Scan- 
dinavian countries, Denmark, Finland, Norway, and Sweden. By combining 
national intellectual traditions with different specialities in American sociolo- 
gy some clearly distinctive features developed within the sociology in the four 
countries. The student revolt at the end of the 1960s and its aftermath had 
substantial effects on the development of Scandinavian sociology. The depen- 
dence on American sociology weakened considerably although it did not 
totally disappear. Another effect was that the differences among the sociolo- 
gies of the Scandinavian countries began to disappear. Rather than national 
sociologies there are now various orientations and code systems more or less 
shared by different groups in all Scandinavian countries. Sociology has 
become much more pluralistic, but in the process there is also an increased 
fragmentation and a loss of unity among the sociologists. 

In the Scandinavian countries there are studies of good quality in many 
fields of sociology. By using international reputation, and the impact on other 
domains of sociology as the two main criteria, four fields are singled out as 
particularly interesting. These fields are studies of welfare and the welfare 
state, comparative studies of social stratification and political sociology, 
women’s studies, and cultural sociology. 


RECENT DEVELOPMENTS IN SCANDINAVIAN 
SOCIOLOGY 


During the last decades, sociology in the Scandinavian countries-——-Denmark, 
Finland, Norway, and Sweden-—has grown considerably in size and become 
diversified in content. The national sociologies cannot any more be described 
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in terms of some simple formulae. During the period immediately following 
World War II Scandinavian sociology was strongly moulded by American 
sociology. Yet, by combining national intellectual traditions with different 
specialities in American sociology, sociology in these four countries de- 
veloped some distinctive features. References here to early developments are 
confined to the four countries mentioned above. Sociology in the fifth society, 
namely Iceland, was not established as a university subject until 1970 (Thor- 
lindsson 1982). 


Previous Developments 


In a paper first published in 1967, the author (Allardt 1967) described 
sociology in four Scandinavian societies with some simple labels. Danish 
sociology was said to focus on the concrete and practical, Finnish sociology 
on macrosociology and the quantitative, Norwegian sociology on the latent 
and the qualitative, and Swedish sociology on the manifest and the methodi- 
cal. 

Labels such as these can of course be critized as too general and therefore 
misleading. Yet, even in retrospect, they seem to have captured something 
essential of the sociology in each of the four countries, up to the second half 
of the 1960s. Now in hindsight it is easy to see that some new tendencies, 
which later grew stronger, were already emerging in the early 1960s. Both the 
general tendencies noted above and recent hindsight merit some brief com- 
ments to allow understanding of the subsequent developments. 

The characterization of Danish sociology as concrete and practical was 
based on the research orientations of the two main centers for sociology 
research in Denmark during the 1950s and early 1960s. These were the 
Danish National Institute of Social Research, under the direction of Henning 
Friis (1960), and the Institute of Sociology at the University of Copenhagen, 
with Kaare Svalastoga as professor and director. Whereas the-former institute 
focussed its research on concrete social problems, the latter became known 
for its large-scale empirical studies on social class and mobility (Svalastoga 
1959). Also at this time, new work emphasizing culture and subjective 
societal elements began to emerge. In 1966 a new Institute for Cultural 
Sociology was founded at the University of Copenhagen. The interest in 
culture and cultural sociology has remained very much alive among Danish 
sociologists throughout the 1970s and 1980s (Due & Madsen 1983). 

In Finland in the late 1950s and early 1960s, a special interest developed in 
studying the structural bases for societal conflicts and cleavages (Allardt & 
Littunen 1964). It arose both out of structurally based political conflicts in the 
Finnish society and out of an academic tradition of studying the rise of the 
working class (Waris 1932). Due to the influence of the ethnosociologist 
Edvard Westermarck (1906-1908), and in contrast to the other Scandinavian 
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countries, Finland already had chairs in sociology at several universities prior 
to World War II. However, there was a reorientation in the sociology taught 
and studied in the late 1940s. Finnish sociological studies became strongly 
quantitative, and sociologists developed a special interest and skill in multi- 
variate techniques. In particular, factor analysis was used and misused in a 
great number of studies. Much of the work on the development of multivariate 
statistical methods in sociology was conducted at the Finnish Research In- 
stitute for Alcohol Studies. Its substantial competence in the study of drinking 
and alcoholism outlived its strong emphasis on factor analysis, and the 
institute became a leading international institution in its field (Bruun & al 
1975). 

When sociology developed in the Scandinavian countries after World War 
II, Norwegian sociology got an especially good start. Both inside and outside 
Scandinavia it was thought the most interesting and original of Scandinavian 
sociologies. The foremost Norwegian speciality was the unveiling of hidden 
and latent role expectations, role strains, latent solidarities, and group ties. 
Norwegian sociology was characterized as focussing on the latent and the 
qualitative, a tendency made manifest in the title The Hidden Society, one of 
Vilhelm Aubert’s (1965) important books. Another good example is Sverre 
Lysgaard’s study of what he called the worker’s collectivity. He showed how 
workers in a factory are tied to each other by hidden bonds, often not clearly 
perceived. The workers form a collectivity that is clearly different from 
organized and formally acknowledged groups such as the local trade unions or 
from the formal hierarchy of the industrial shop (Lysgaard 1961). Focus on 
latent effects also characterized the best Norwegian studies in criminology 
such as Nils Christie’s study (1960) of punitive treatments of alcoholics. 

Although the foremost Norwegian speciality during the two first decades 
after World War II was qualitative analyses of latent social forms, some 
advances in quantitative methodology can be observed in work done during 
the 1960s (Galtung 1970). Some special forms of quantitative methodology 
were developed to facilitate research on center-periphery relations, which 
have fascinated analysts of the Norwegian society. Internationally well- 
known theories about centers and peripheries were developed both within the 
field of peace research (Galtung 1978) and in political sociology (Rokkan 
1970). During the subsequent periods both the substantial focus on center- 
periphery relations and on quantitative methodology grew in importance 
within Norwegian sociology. Natalie Rogoff Ramsøy (1965), with her back- 
ground at the Bureau of Applied Social Research (Columbia University) and 
her experience in studying social mobility, began to influence Norwegian 
empirical research. 

That Swedish sociology was centered on the manifest and methodological 
was based on the idea that Swedish sociology had quickly reached a high level 
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of sophistication in theory and methodology, whereas it was fairly tame and 
noncommittal in its substantive descriptions of the Swedish society. Torgny 
Segerstedt (1955) at Uppsala University defined sociology as the study of 
social norms and uniform behavior, and this inspired many Swedish sociolo- 
gists of the first generation to study the importance of language and verbal 
persuasion in social life. One of Segerstedt’s early students, Hans L. Zetter- 
berg, made a successful career within American sociology (Zetterberg 1954). 
Among the first Swedish sociology professors, Gösta Carlsson (1958) con- 
ducted an excellent study of social mobility, but its main contribution was 
methodological rather than factual information about Swedish society. 

During the 1960s new tendencies clearly emerged in Swedish sociology. 
Ulf Himmelstrand (1960) gave the theory of social norms a new direction by 
focussing on political conflicts and by studying situations in which social 
norms fail to function. Most explicitly Joachim Israel emphasized a conflict- 
theoretical perspective by analyzing the concept and the literature on aliena- 
tion (Israel 1968), combining both Marxist and non-Marxist approaches. 
Edmund Dahlström was one of the first to study the life situation of women 
(Dahiström et al 1962). On the whole, toward the end of the 1960s Swedish 
sociology was moving into a new phase. 

Established at the University of Iceland in 1970, Icelandic sociology was 
shaped mainly by British ideas. The Scandinavian influences were minor 
(Thorlindsson 1982), and there were hardly any domestic antecedents. As an 
Icelandic political scientist (Grimsson 1977) has concluded: “A department of 
social science was established in the autumn of 1970, enabling Icelandic 
students to obtain a B.A. degree in political science and sociology. These 
disciplines were, however, without any roots in the Icelandic academic or 
social community: they did not even have acknowledged names in the 
Icelandic language.” 


The Student Revolt in 1968 


The student revolt at the end of the 1960s (often dated to 1968 due the events 
in Paris in May of that year) was not as violent in the Scandinavian countries 
as in many continental countries, or in the United States or Japan. The student 
revolt did, however, occur in the Scandinavian countries, and it had a 
profound effect on the development of sociology. The revolt introduced a new 
interest in Marxism and in conflict-oriented perspectives generally. The 
uprising occurred during the Vietnam war and was also partly inspired by the 
protests against the war. It was clearly related to anti-American feelings. Even 
if these feelings were hardly of a very lasting importance, American sociology 
and especially some branches of American functionalism came under hard 
attacks. 
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The effects of the student revolt and its aftermath on Scandinavian sociolo- 
gy were hard to pinpoint because they were both penetrating and small. The 
effects were small in terms of substance: most types of now ongoing research 
have their roots in work in progress when the student revolt came. Yet, the 
changes are profound as regards some general and international orientations. 
These changes may be summarized in three main points: 

1. The dependence on American sociology began to weaken. It did not 
wholly disappear, but the earlier tendency to take American sociology as a 
given model more or less disappeared. Citation studies have also shown that 
the number of references to American sources decreased. A more problematic 
feature is that, according to a Finnish study (Rautio & Suhonen 1981), the 
interest in sociology beyond the national borders seems to have weakened. 
The citations of domestic sources are the ones that have increased. There are 
good grounds to assume that the tendency is much the same in the other 
Scandinavian countries. | 

2. At the same time, sociology in all Scandinavian countries has become 
much more pluralistic in its emphases and orientations. Marxist approaches, 
traditional American sociology, and influences from new schools such as the 
French semiotics, offshoots from the critical Frankfurt school, and rational 
choice models exist side by side. 

3. The differences among the sociologies of the different Scandinavian 
countries have become very difficult to trace. Rather than national sociolo- 
gies, there are now different orientations and code systems more or less 
shared by different groups in all the four countries. At any rate, the events 
related to 1968 definitely erased the traditional differences between the 
sociology of the different countries. 

Therefore it seems best to proceed here by arranging the contents in terms 
of general problems and research orientations existing in all four countries, 
rather than in terms of national differences. It also means that we will refrain 
from attempting short-hand characterizations of the typical Danish, Finnish, 
Icelandic, Norwegian, and Swedish sociology. 

Of the three points mentioned above, the second one emphasizing pluralism 
particularly needs more elucidation. It was said that the student revolt brought 
Marxism back into sociology. This was, as we know, an international phe- 
nomenon, but it meant slightly different things in different countries. In the 
Scandinavian countries it meant that many sociologists indicated some inter- 
est in Marxist ideas and thoughts but that only a very few confined themselves 
to Marxism. In most cases it really meant a pluralistic orientation. A very 
illuminating case is the book on alienation by Joachim Israel (1968). It was 
based on a scrutiny of the concept of alienation in both traditional Marxist and 
modern sociological literature. This book was the first of its kind and in- 
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dicated a path emphasizing the simultaneous use of Marxist and other sources. 
In terms of ideas, it displayed a kind of kinship with Herbert Marcuse and the 
Frankfurt school. 

The best recent theoreticians in Scandinavian sociology are not one-sided 
Marxist or protagonists of Marxism, rather, they have been able to use 
Marxism and to combine it with other approaches. Typical are the two 
important Norwegian theoreticians, Jon Elster and Dag @sterberg. Jon Elster 
has combined Marxist approaches, analytical philosophy, and modern modal 
logic to establish himself as a leading international methodologist of the social 
sciences. He has thrown new light on many of the core problems in sociolo- 
gical theory such as on the nature of functionalism, the nature of unintended 
consequences, the contradictions of society, and the relationships between 
causality and intentionality (Elster 1978). Dag @sterberg is mainly known for 
his excellent Norwegian textbooks in which he eludicates sociological tradi- 
tions and concepts with great skill and understanding of the ideas behind them 
(Osterberg 1978). 

A deliberate attempt to emphasize the importance of combining different 
approaches was made by Ulf Himmelstrand at Uppsala University in his 
Paradigm project. The papers of the project were published in three volumes 
(Himmelstrand 1986, 1987) and included an almost equal share of in- 
ternational and Swedish contributions. 

There have also been systematic attempts to carry out research in terms of a 
consistent Marxist approach. During the first half of the 1970s, Althusser and 
his disciple Poulanzas had a considerable impact on some Swedish sociolo- 
gists, in particular. Some of the best studies with this orientation were 
conducted by Göran Therborn, then at Lund, and now in Gothenburg. His 
doctoral thesis “Science, Class, and Society: On the Formation of Sociology 
and Historical Materialism” (Therborn 1974) is, as its subtitle announces, a 
Marxist study in the sociology of sociology. Yet, Therborn by no means 
confined himself to a Marxist approach. Among other Marxist studies, Jukka 
Gronow’s (1986) doctoral thesis at the University of Helsinki, on the relation- 
ship between Kautsky’s and Marx’s ideas, deserves a special mention. The 
most persistent Marxist group in Scandinavian sociology has probably been 
the school of Capital Logic at the University of Copenhagen (see Due & 
Madsen 1983). 

The references to Marxism here should not be taken as an indication that 
Marxist orientations are particularly strong or that Marxist research is well 
developed within Scandinavian sociology. It cannot, however, be passed over 
because at the beginning of the 1970s many Scandinavian students came to 
believe that Marxism was almost the only worthwhile approach in sociology. 
As a rule, the Swedish and Norwegian Marxist sociologists were mainly 
inspired by Frenchmen such as Sartre and Althusser, whereas their colleagues 
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in Denmark and Finland were influenced by German Marxists. The Frankfurt 
school and particularly Jürgen Habermas had an influence in all the Scandina- 
vian countries. In all these countries, Soviet Marxism was of minor im- 
portance. . l 

Only a few researchers and scholars have developed in-depth knowledge of 
Marxist thinking. Generally, however, Marxism is conceived as part of 
sociology’s heritage. Most students know as much, or actually as little, about 
Marx as they know about Max Weber and Emile Durkheim. All advanced 
Scandinavian students in sociology are aware of the fact that all three con- 
stitute an important part of the heritage of sociology. 


Loss of Unity and a Variety of Code Systems 


The increased pluralism mentioned in the preceding section and the tolerance 
shown in facing very different approaches and paradigms also have their 
liabilities. Within sociology in all Scandinavian countries, a loss of communi- 
ty can be observed. These sociologists used to form an academic community 
with shared knowledge, common aims, and a basic agreement about what 
constitutes good sociology. Today there is a tendency to form factions that use 
a language of their own and that tend to develop their own particularistic 
code-systems. Sociologists of different persuasions and using different para- 
digms only rarely involve themselves in joint debates and discussions. This 
fragmentation is partly due to the greatly increased number of sociologists. In 
the 1950s and the beginning of the 1960s, the community of sociologists in 
each of the Scandinavian countries was not only small, it was almost visible at 
a glance. It was possible within the national community of sociologists to 
know all good researchers and scholars independent of their specialty. The 
loss of visibility and community, however, is not fully explained by the 
increase in the number of sociologists. Present-day intellectual life tends to 
become fragmented. into a variety of different code system. At least in 
Europe, sociology is not only a science; it also represents an intellectual 
activity sensitively reflecting the general mood in intellectual and cultural 
circles. 

There are, to be true, institutional arrangements aimed at emphasizing 
shared aims and common interests among the sociologists. In Finland, Nor- 
way, and Sweden, national sociological journals are published in the domestic 
languages. In Finland we have Sosiologia, founded in 1964, and in Sweden 
Sociologisk Forskning, also founded in 1964. In Norway there are two 
journals of importance. Sosiologi i dag, originally a student initiative, started 
in 1970; but of even greater importance for Norwegian, and one might say 
Nordic, sociology has been the Norwegian. general social science journal, 
Tidskrift for Samfunnsforskning, first published in 1960. All these journals are 
of considerable importance for the sociologists in their own countries. In 
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Denmark, among all the Scandinavian countries, the fragmentation among the 
sociologists has gone farthest and the student revolt hit hardest. Thus, there is 
no joint general Danish sociological journal. In Denmark as in Finland, 
Norway, and Sweden, there is a national sociological association. In the fifth 
Scandinavian or Nordic country, Iceland, a general social science association 
also represents sociology. 

The national sociological associations and the Social Science Association 
of Iceland have joined together in the Scandinavian Sociological Association. 
It organizes Scandinavian congresses in sociology every second year and 
publishes Acta Sociologica, sponsored by the Joint committee of the Nordic 
Social Science Research Councils. 

Despite Nordic cooperation and the publication of Acta Sociologica, it 
seems fair to say that a loss of unity can be observed in Scandinavian 
cooperation. Only a fraction of the Scandinavian sociologists participate in 
the Scandinavian congresses. Furthermore, many sociologists prefer to pub- 
lish their most important findings and messages in national journals in their 
domestic languages, rather than in Acta Sociologica. 

In a recent evaluation of Swedish sociology, evaluators representing Fin- 
land, Norway, and Denmark/the United States (Allardt, Lysgaard, & Søren- 
sen 1988) maintained that among the younger Swedish sociologists there is a 
tendency to self-sufficiency. They prefer to publish in their own domestic 
journals rather than in Acta Sociologica, and they tend to restrict their foreign 
travel to short periods. This characterization certainly applies also to the other 
Scandinavian countries. 

In a sense the diminishing importance of the sociological community 
extends itself into the world of ideas. An increasing number of sociologists 
publish their results and viewpoints for other sociologists of their own kind 
and persuasion. They confine themselves to persons using the same code- 
system as their own. At any rate they tend increasingly to prefer not only 
domestic, but also in other respects particularistic, publication outlets. This 
tendency is probably not confined to the Scandinavian countries only. It is a 
fairly general pattern among sociologists in advanced industrial societies, 
probably at least partly explained by the fact that sociology has become 
clearly established and institutionalized, at least in Finland, Norway, and 
Sweden. In Denmark the fate and the organization of academic sociology is 
pending. The institutes of sociology at the. University of Copenhagen have for 
some time been without sociologists of professorial rank. Not only sociolo- 
gists but also the Danish Ministry of Education, the university administration, 
and the trade unions representing university teachers are today involved in 
debates about the proper organization of sociological teaching and research in 
Denmark. At any rate, developments in Finland, Norway, and Sweden reveal 
how maturation and institutionalization can be problematic. 
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In a short paper describing general tendencies it is impossible to mention all 
the important studies, books, and sociologists. Any selection of names tends 
to become subjective. Yet, it is important to try to specify the criteria for 
inclusion in this paper. Two criteria are of particular importance. First, our 
reporting is confined to sociologists and studies that have had or are likely to 
have some international impact. Second, the studies and the sociologists 
mentioned are supposed to have some impact outside their own domain or 
field of competence. The assumption is that good sociology has a general 
impact, and that its ideas and concepts also can be applied to social phe- 
nomena outside the field originally studied. A great number of competently 
performed and methodologically sound sociological studies are of interest 
only in a narrow field. Some sociologists concentrate entirely on studies of 
this type and associate only with sociologists of their own specialty. The 
members of such groups often quote each other intensively, but their good 
positions on citation index lists do not alter the fact that their messages are 
hardly of a general interest. 

With these two criteria in mind, the following presentation of some singular 
studies and sociologists is restricted to four particular fields: 


1. Studies of well-being, quality of life, and the welfare society. 

2. Comparative research in the fields of social stratification and political 
sociology.. 

3. Women’s studies. 

4. Cultural sociology. 


There are also studies of high quality in such areas as the sociology of 
work, studies of mass communication, and family research. Of these the 
Scandinavian sociology of work in particular has been of an international 
importance in its own field. The Work Research Institute in Oslo and The 
Swedish Center for the Study of Working Life in Stockholm have developed 
into important international centers for the study of work and working life. At 
the Norwegian Institute and now also affilitiated with the Swedish Institute, 
Bjgrn Gustavsen has initiated a relevant and perceptive discussion of the role 
of social science in initiating reforms (Gustavsen 1980). Yet, the importance 
of the Scandinavian sociology of work is mainly confined to its own field. 
The studies of the two institutes mentioned above have influenced sociologi- 
cal research in other branches of sociology only to a minor degree. 


The Scandinavian Approach to Welfare Research 


As a political model the Scandinavian welfare state was already being dis- 
cussed in the 1930s. In comparison with the economists sociologists were late 
in focussing their interest on the special forms and attributes of the Scandina- 
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vian welfare state. This lag was due to a strong dependence on American 
sources in the formative years of Scandinavian sociology. At any rate, toward 
the end of 1960s and the beginning of the 1970s, an emerging and rapidly 
enlarging interest developed in welfare research. This new interest was related 
to the social indicator movement which grew strong in the international social 
science community toward the end of the 1960s. The Scandinavian interest in 
social indicators was definitely focussed on the question of how people live in 
a welfare state. A strong tradition arose of conducting surveys of how well 
individuals actually fare in the Scandinavian states. This research tradition has 
recently been described well in The Scandinavian Model: Welfare States and 
Welfare Research, edited by Robert Erikson, Erik Jørgen Hansen, Stein 
Ringen, and Hannu Uusitalo (1987). Typical of Scandinavian surveys of 
citizen well-being were: their attempt to capture comprehensively most 
aspects of people’s level of living, the reliance on objective rather than on 
subjective attitudinal indicators, and an interest in the problem of inequality. 

Surveys of well-being have been continually refined, but during the 1980s 
the main focus of research was on defining the welfare state and its in- 
stitutional arrangements rather than measuring individual well-being. A good 
example is a report From Poor Relief Toward Institutional Welfare States: 
The Development of Scandinavian Social Policy (1984), written by Gösta 
Esping-Andersen and Walter Korpi. The study is a preliminary report from a 
larger research project covering 18 OECD countries. Its main message is that 
there are very divergent solutions to social policy in modern industrialized 
states, and the Scandinavian model aims at comprehensivess, in contrast to 
residual social policy models which aim to help only those in real need. 

At present several comparative projects of social policy and the welfare 
state are being conducted. Of special weight is an international project, 
Growth to Limits. The Western European Welfare States Since World War II 
(1986), led and edited by Peter Flora of the University of Mannheim in West 
Germany. The first volume contains book-length and comprehensive chapters 
on Denmark, Finland, Norway, and Sweden, written by Lars Nørby Johan- 
sen, Matti Alestalo, Hannu Uusitalo, Stein Kuhnle, and Sven Olson. 

A distinguishing feature of the Scandinavian studies of well-being and the 
welfare state is that they clearly relate social policies to the over-all social 
structure and to the political system. Conventional research on social policy 
was often related solely to a description of the policy measures, but the new 
type of welfare research is intended to present comprehensive societal de- 
scriptions. Closely related to the studies of the welfare state was the Norwe- 
gian power study, commissioned by the Norwegian government in 1972. Ona 
descriptive level it gives an excellent account of a society based on negotia- 
tions between different actors representing a mixture of governmental, trade 
unionist, political, entrepeneurial and generally private interests (Hernes 
1978). On the theoretical level the Norwegian power study contains a thor- 
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ough and in many respects innovative analysis of the exchange concept, using 
ideas not only from sociological theory but also from economics and political 
science (Hernes 1975). In this latter sense the Norwegian power study also 
reflects a general methodological tendency. 

Within Scandinavian social science there is an increasing conviction that 
good and fertile studies about social and societal phenomena are in- 
terdisciplinary, in the sense that they apply ideas from all the social sciences. 
There. is a discussion of whether graduate training in the social sciences ought 
to be clearly interdisciplinary. At any rate, the Norwegian power study has a 
strong Scandinavian impact. It influenced the Swedish government to com- 
mission a Swedish power study in 1985, and it stimulated the researchers of 
the Danish Welfare Survey to make a special and interesting report on power 
relations in Danish society (Valentin 1980). 


Comparative Studies of Social Stratification and Political 
Sociology 


The Scandinavian welfare studies have had a-strong comparative bent. Not 
only are many of them based on empirical comparisons between countries, 
but also the studies focussing on a Scandinavian society. have clearly empha- 
sized a comparative perspective. The same is true for recent Scandinavian 
studies of social stratification. _ 

Several comparative studies of class and social mobility involving some or 
all of the Scandinavian countries are presently being conducted. An in- 
ternational research project with 10 participant countries, led by Erik Olin 
Wright and based on his framework for class analysis indicated interesting 
differences in the class structure of the United States, and the Scandinavian 
countries (Ahrne & Wright 1983). The study has also stimulated com- 
prehensive national studies, such as the one about Finland, and theoretical 
discussions based on their results (Blom 1985). 

The most interesting theoretical development within the field of social 
stratification is the increasing tendency to combine a structural and an in- 
stitutional approach. There is a new interest not only in describing the class 
structure but also in analyzing how politics and institutional arrangements in 
the welfare state mould the class structure. Two studies are important in this 
respect. The first was Gösta Esping-Andersen’s path-making comparison 
(1980) of class, politics, and welfare policies in Denmark and Sweden. The 
other is Matti Alestalo’s study of Finland (1986) with the illuminating title 
Structural Change, Classes and the State. Finland in an Historical and 
Comparative Perspective. 

As can be seen, the traditional tie between studies of social stratification 
and political sociology has in recent years been rejuvenated and strengthened. 
A leading center for comparative sociology in Scandinavia has been the 
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University of Bergen with its Center for Comparative Politics founded by 
Stein Rokkan. One of his creations was the useful and extensively used 
international ecological data archive at the University of Bergen (Brosveet et 
al 1981). Before his death in 1979 Stein Rokkan constructed what he called “a 
conceptual map of Europe,” a theoretically and empirically grounded multi- 
dimensional typology of the territorial structuring of western Europe (Rokkan 
& Urwin 1983). Rokkan’s work was indeed multidisciplinary, using elements 
from sociology, political science, economic history, geography, and linguis- 
tics. 

The nation-building approach has been of special interest to Norwegian and 
Finnish researchers, probably because Finland and Norway are among the 
newer European nations. The usefulness of a comparative approach in studies 
of a single nation is excellently demonstrated by Risto Alapuro’s (1988) 
recent book State and Revolution in Finland. A useful textbook about an- 
alyses and theories of national development was presented by Øyvind Østerud 
as early as 1978. The increasing tendency to interdisciplinary work and the 
use of ideas from all the social sciences have also led to critical assessments of 
the present mixture of welfare research and political sociology. In an impor- 
tant critique of the present welfare research, Göran Therborn (1987) presents 
an interesting and unorthodox mixture of ideas from Marxism and the public 
choice school. 


Women’s Studies 


As in many other countries, in the last decade the study of women in the 
Scandinavian sociologies developed into a very active field with a large 
number of researchers. Students who specialize in studying women in society 
and problems related to gender are among the most motivated of sociology 
students. One striking feature about the field is that all the leading researchers 
are women. This was true as well for the first generation of scholars in the 
field such as Harriet Holter (1970) in Norway, Elina Haavio-Mannila in 
Finland (Haavio-Mannila & Dahlerup 1985), and Rita Liljeström (1982) in 
Sweden, as well as for the young graduate students of today. Representatives 
for women studies have often maintained that male sociologists are not 
interested in the approaches and the findings from studies of women. Now, a 
change of attitude appears underway, undoubtedly because women’s studies 
and the sociology of gender have produced results and introduced approaches 
of general importance. 

In the Scandinavian countries women play an increasingly important part in 
the labor market—in Finland, for instance, 47.8% of the labor force in 1985 
was women-—and it is natural that the study of women is of great importance 
in all studies of working life. Segregation according to sex is also increasingly 
observed in the methodology of studies of social stratification. The study of 
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equality related to sex has influenced formulations of problems within the 
sociology of law. The use of special forms of depth interviews in the field of 
women studies has also had a positive effect on the development of qualitative 
methodology (Swedberg 1988). With the rise of women’s studies, new 
insights have been gained about certain forms of social policy and community 
care (Waerness 1987). 

Women’s studies have been both active and actively supported in Norway 
in particular. A Secretariat for Women Studies was established at the Norwe- 
gian National Science Foundation (NAVF). These studies have been pre- 
sented in an impressive series of 18 volumes. One of its present large-scale 
projects is to develop a program for comparative and intercultural women’s 
studies. 


Cultural Sociology 


During the 1980s there has been upsurge of interest in French structuralism, 
semiotics, and linguistic analysis. Foucalt, Derrida, and Baudrillard have 
been popular exemplars. Yet, the interest in the new French semiotics has 
been visible mainly among the youngest generation of sociologists and gradu- 
ate students, and it has as yet produced hardly any weighty research results. 
An increase in the interest in cultural analysis, however, has been visible in all 
the Scandinavian countries. The importance of cultural dispositions and 
culturally determined interpretations of social forms has been stressed in 
many fields of study. A recent source of inspiration has come from the French 
sociologist, Pierre Bourdieu (Sulkunen 1982). Among other things he has 
inspired analyses of the strong emphasis on higher education in present-day 
Scandinavian societies (Gesser 1985; 274—79) and also studies of the rise of a 
new middle class (Roos & Rahkonen 1985). 

Some of the sociologically most interesting cultural analyses have been 
produced by a group of Swedish ethnologists focusing their interest on how 
dominant cultural forms develop and how they relate to the class structure in 
society. Jonas Frykman and Orvar Löfgren (1979) have sharply and wittily 
analyzed the rise of the bourgeois culture during the nineteenth century, and 
its effects on commonly accepted views and perceptions. 

With the increased interest in cultural analysis, there has also been an 
increased emphasis on qualitative methodology. Particularly in the 1970s and 
the first years of the 1980s, qualitative methods enjoyed a strong popularity 
among students in particular. However, reports from several sociology de- 
partments in the Scandinavian countries indicate that the student preferences 
now during the second half of the 1980s are turning in another direction. With 
the increasing use of computers in both school and leisure life by teenagers 
and children, the new generations of students are said to prefer quantitative 
methods more strongly than did the immediately preceding generations. 
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Abstract 


Recent scholars have drawn upon the insights of Marx and Weber in a 
renewed effort to explain the origins of capitalism in Western Europe. Few 
Weberians or Marxists have addressed the specific role of Protestantism in 
fostering rational economic action; instead they speak of modernization or of 
the rise of the West. Marxists are divided over whether capitalism developed 
out of conflicts among classes in feudal society or whether an external market 
sector served to undermine feudalism and to stimulate new forms of produc- 
tion. Analyses of the world system, proto-industry, and the seventeenth 
century crisis attempt to explain the concentration of capital and of production 
in a few Western European countries. Studies of agrarian class conflict and of 
absolutism address the formation of the bourgeoisie. The most valuable recent 
syntheses have come from scholars who combine class analysis with an 
examination of the particular interests of those actors who inhabited the 
complex of institutions that cohered into nation-states. 


INTRODUCTION 


Social scientists and historians in recent years have renewed their attention to 
a question that Marx and Weber both considered central to their studies of 
modern society: How and why did capitalism first develop in Western Eu- 
rope? Debates among Marxists and with scholars inspired by Weber have 
spurred much useful historical research and analysis. However, the most 
powerful insights have come from scholars who seek to synthesize Marxist 
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and Weberian approaches, most notably in works that relate capitalism to the 
political development of nation-states. 

This article begins by presenting the chief lines of analysis that have 
developed out of Marx’s and Weber’s treatments of the origins of capitalism. 
Debates among adherents of the various schools are animated by their differ- 
ent understandings of how Marx and Weber formulated the problem of 
capitalist origins. These differences are expressed by the ways in which they 
employ Marx’s and Weber’s vocabularies to define capitalism and the state. 
In critiquing work within and across the different Marxist and Weberian 
schools, this article evaluates the uses and limitations of each definition. 

Proponents of each school define capitalism, and therefore the problem of 
capitalist origins, in somewhat different ways. Within Marxism, the principal 
line of division was highlighted in the debate between Maurice Dobb, who 
views capitalism as a relation between a property-owning bourgeoisie and a 
laboring proletariat (1947: 11-15), and Paul Sweezy, who defines capitalism 
as production for the market ([1950] 1976). Weberians are closer to Sweezy 
than to Dobb in that they analyze capitalism as a practice (of rational 
economic action) as well as in structural terms. Scholars from all these 
perspectives make a point of differentiating capitalism in general from the 
specific techniques, scale, and processes associated with the more recent 
specific form of industrial capitalism. In their emphases on origins, the 
scholars reviewed here look for the first indications of those characteristics of 
capitalism which they contrast with the defining elements of feudal or tradi- 
tional society: economic vs extra~economic surplus extraction, proletarian vs 
serf labor, production for the market vs production for use, world or national 
economic units vs manorial or “natural” economies, rational vs traditional 
economic action, individual vs collective property and economic interest, and 
nation-state vs feudal political organization. Theorists from the various per- 
spectives are in greater agreement on the definition of the nation-state. All 
would agree with Tilly that a nation-state, at a minimum, “controlled a 
well-defined, continuous territory . . . was relatively centralized . . . was 
differentiated from other organizations . . . [and] reinforced its claims through 
a tendency to acquire a monopoly over the concentrated means of physical 
coercion within its territory” (1975:27). 

The concern with origins has served to center the historical debate, and 
hence the emphasis in this article, upon the major nations of Western Europe, 
principally England and France, and to a lesser extent the Netherlands and 
Italian and German city-states. Eastern Europe is compared in broad terms, by 
Marxists in particular, to the more advanced capitalism of England and 
France. Regions of Europe not controlled by centralized states and where 
large landlords were not the dominant political power are slighted. The 
extensive research done on those areas in Europe where capitalism developed 
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relatively late addresses somewhat different theoretical problems and would 
need to be the subject of another review essay. (Chirot (1976) offers a strong 
critique of efforts to apply western European models of capitalistic develop- 
ment to Eastern Europe.) 


WEBERIAN APPROACHES TO THE ORIGINS OF 
CAPITALIST RATIONALITY 


Weber ([1922] 1978:1086) viewed feudalism as “inelastic . . . a chronic 
condition” which did not contain within itself the elements necessary to the 
development of rational economic technique nor to the practice of rational 
economic action. As a result, Weber looked to sources external to feudalism 
for the origins of capitalism. Surprisingly, Weber’s most famous discussion 
of the origins of capitalism, his theory of the Protestant ethic, has provoked 
almost as little recent scholarship from non-Marxists as it has from Marxists. 
Elaborators of Weber’s work on the Reformation have been more concerned 
with how political and economic forces affected Protestant doctrine than with 
whether Protestant belief contributed to capitalist action. Schluchter (1981) 
has sparked a revival of efforts to interpret all of Weber’s writings on world 
religions as part of a unified developmental model that culminates in Protes- 
tant “world mastery.” Yet, Schluchter does not attempt, as Weber himself 
did, to draw temporal, spatial, or causal links between Protestant belief and 
capitalist action. 

Most contemporary scholars who draw upon Weber’s ideological analyses, 
rather than his institutional studies of class and state, have attenuated Weber’s 
precise and careful contrast between rational economic technique and rational 
economic action, and his hypothesis about the role of Protestantism in foster- 
ing the latter: they have turned instead to broad generalizations about mod- 
ermization. Eisenstadt (1968a:7~-8) advocates “a shift of attention from the 
allegedly direct, causal relationship between Protestantism and capitalism . . . 
to the internal transformative capacities of Protestantism and to their impact 
on the transformation of the modern world.” 

While Eisenstadt (1968b) addresses modernization in general and looks for 
the functional equivalents to Protestantism in non-European settings, John A. 
Hall (1985) seeks to specify the role of Christianity in making Western 
Europe the first site of modernity. Hall places the beginning of this process in 
the ninth century, seeing that as the moment when technological innovation 
and market exchange began to differentiate Western Europe from the rest of 
the “ancient world.” Hall sees the decline of the western Roman Empire, and 
the consequent creation of strong corporate bodies of feudal landlords and free 
peasants, as crucial to the development of a modernizing Christian doctrine. 
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In contrast, where the eastern Roman Empire persevered, Christianity pro- 
vided a “Casesaropapist doctrine for Byzantium” (1985: 120). 

In Western Europe, according to Hall, Christianity justified each corporate 
body’s rights, thereby preventing lay or papal empires from weakening local 
autonomy. Instead, weak states ensured a multiplicity of political centers 
while Christianity provided a common culture to facilitate trade among the 
otherwise independent and isolated regions and locales. The same Christian 
doctrine also encouraged the weakening of kin ties [a development Weber 
(1978 : 1243—44) attributes to urban living], which in turn modified population 
increase, allowing peasants to enjoy increasing per capita incomes from the 
rise of trade and from technological advances. The stalemate among political 
bodies forced lords and peasants alike to invest in land and commerce, rather 
than in empire. Thus, for Hall, Western European dynamism is due to “the 
curious concatenation of circumstances:” (1985: 111) the plurality of classes, 
markets, and states given legitimacy and continuity by Christian doctrine. 

Goldstone (1987) criticizes Hall—and indeed the entire Weberian canon on 
the origins of capitalism—for confusing entrepreneurship and rational 
calculation with innovation and risk-taking, which Goldstone believes to have 
been the true engines of economic growth. Goldstone identifies eighteenth 
century England as the time and place when technological innovation acceler- 
ated. He argues that England’s uniqueness stemmed from the state’s toleration 
for ideological diversity, a toleration which was the only significant legacy of 
the Revolution and Civil War. In contrast, the Spanish state and those of 
Ottoman Turkey and China emerged from their seventeenth century crises 
with renewed efforts to enforce ideological orthodoxy. The Spanish, Otto- 
man, and Chinese empires survived for two more centuries, although with 
economies that stagnated as subjects avoided risk and thereby stifled innova- 
tion. 

The most compelling defense of Weber is presented by Collins, who 
contends that in the General Economic Theory, “Protestantism is only the last 
intensification of one of the chains of factors leading to rational capitalism” 
(1986: 33). As a result, Collins sees the Italian Renaissance cities as sites of 
capitalist development, especially after the fourteenth century popular rebel- 
lions “which replaced the charismatic law of the older patrician class with the 
universalistic and “rationally instituted” law upon which so much of the . 
institutional development of law was to depend” (1986:41). According to 
Collins, the Weber of the General Economic Theory attributes the decline of 
the Italian city-states relative to England more to the advantage which nation- 
states enjoyed in the competition of the world market than to the added 
capitalist fervor induced by the Protestant Ethic. 

Collins uses the development of legal and other aspects of rationality to 
explain Renaissance capitalism, while calling upon other institutional and 
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religious developments to explain the northern Europeans’ later advantages. 
For Collins, institutional factors matter mainly to the extent that they propel 
individuals to rational economic action. Collins, as he himself points out, is 
close to Wallerstein in his (1974, 1980) world system model (discussed 
below) in pointing to foreign competition, rather than to the class dynamics 
internal to cities or to nation-states, to explain the shift of economic power 
from Catholic Italy to Protestant Holland and England. Collins’s convergence 
with Wallerstein is possible because both view capitalism as a practice 
defined, for Collins in part and for Wallerstein in full, by the pursuit of profit 
through exchange. Both emphasize national politico-economic units, rather 
than classes, as the crucial elements in capitalist social relations. 

Chirot also sets the development of capitalism within a longer process 
of institutional rationalization. Chirot, like Hall, believes “the rise of the 
West . . . is so situational, so bound by the context of a given time, that it 
offers no lessons for the present” (1985: 193). Chirot views the long-term 
stalemate among classes and between clergy and laity in Europe since the 
eleventh century as creating an interest among all actors in fostering legal and 
religious rationalization. Such rationality, combined with the autonomy cities 
were able to win from stymied rural lords, created an environment for rational 
economic action that “even before the Reformation . . . was an important way 
of looking at the world in key urban circles” (1985: 191). 

An even more extreme example of this tendency to locate the sources of 
capitalist rationality prior to the Reformation is provided by Macfarlane who 
argues that the entire debate over the origins of capitalism is invalid for 
England where there was an individualist ideology and therefore “a developed 
market and mobility of labour, land was treated as a commodity and full 
private ownership was established, there was very considerable geographical 
and social mobility, a complete distinction between farm and family existed, 
and rational accounting and the profit motive were widespread,” at least since 
the thirteenth century (1978:195). Macfarlane equates those characteristics 
with capitalism and concludes that England was as capitalist in Marx’s and 
Weber’s senses of that word “in 1250 as it was in 1550 or 1750” (1978 : 195). 
England, therefore, was uniquely positioned to take advantage of the tech- 
nological advances and colonial opportunities that emerged in the late eigh- 
teenth century. 

Macfarlane’s work is criticized for its almost total reliance upon “tax 
records and parish registers [which] leave so many things out” (Stone 
1979:40). Macfarlane fails to understand that before the sixteenth century 
Englishmen were selling or exchanging villeinage rights to work land and not 
actual private property. “He totally ignores the close communal control, 
through the manorial court, of almost every aspect of the use of property 
[including] so many aspects of personal life that it is difficult to see where in 
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the medieval village the concept of individualism found room to flourish 
outside the one sphere Macfarlane emphasizes: the power to sell or bequeath 
property” (Stone 1979:41). 

Collins, Hall, Goldstone, Chirot, and Macfarlane represent an advance 
upon Eisenstadt in that they ground institutional rationalization, modernizing 
Christianity, pluralism and innovation, or English individualism, in specific 
geographic and temporal conditions; they do not argue that such an ideology 
is reproducible in almost any place and time. However, their efforts to 
differentiate Europe, or England, from the rest of the world serve to downplay 
the complex causal importance Weber gives to the Reformation. Instead, with 
the exception of Goldstone, they present modernization as a long-term pro- 
cess that proceeded without major interruption in those places where the 
necessary geopolitical and ideological conditions existed. 

The common difficulties with Eisenstadt’s general model, and Collins’s, 
Chirot’s, and Hall’s historical models, have been flagged by Tilly in his 
(1975) critique of political development. Such models, Tilly writes, draw on 
an image of “political process which only became prominent in the nineteenth 
century [namely steady, orderly evolution] . . . Such a literature seems 
unlikely to yield statements about the conditions under which a given political 
structure will disintegrate, stagnate, combine with others, or transform itself 
into a variety which had never been seen before” (1975:615). Only Gold- 
stone, of the scholars reviewed above, is concerned with analyzing the causal 
relation between seventeenth century political crisis and capitalist develop- 
ment. Eisenstadt’s ungrounded, and Collins’s, Hall’s, and Chirot’s very 
long-term, models of development share an inability to explain the very 
differences in timing and form of capitalist development across Europe that 
were the objects of both Marx’s and Weber’s analyses. While Macfarlane 
offers a nationalist-cultural reason for England’s early capitalist development, 
he and Goldstone are unable to explain why some English individuals gained 
political or economic advantage over others, nor how that acvantage was 
applied to particular social relations in different eras. 

Collins, Chirot, Hall, and Macfarlane consider changes over very long 
periods of time, while Eisenstadt refers to cases from all continents and 
historical eras. As a result, all five avoid specifying causal, as opposed to 
conjunctural, relations between the various ideological, geographical, and 
institutional factors they highlight on the one hand and the development of 
modern or capitalist practices and relations on the other. The unwillingness of 
Chirot, Hall, and Eisenstadt to address Weber’s hypotheses (and Macfarlane’s 
facile dismissal of them) are especially regrettable considering the dearth of 
Weberians and Marxists who have elaborated or critiqued Weber’s Protestant 
ethic thesis. Collins’s focus upon the General Economic Theory provides a 
welcome bridge between Weberian and Marxist concerns, yet also signals an 


ORIGINS OF CAPITALISM 53 


abandonment of a needed empirical and theoretical dialogue with the Protes- 
tant ethic thesis. 

A model of historical scholarship informed by, yet critical of, Weber’s 
theoretical framework is offered by Walzer (1965) who contends that in 
England Puritan Protestant beliefs “led to a fearful demand for economic 
restriction (and political control) rather than to entrepreneurial activity as 
Weber described it” (1965:304). Puritans acted to undermine traditional 
practices; however, they and their noncapitalist economic vision were de- 
feated in the Civil War by liberal capitalists whose beliefs and sources of 
political victory remain unanalyzed by Walzer. “This, then, is the relation of 
Puritanism to the liberal world: it is perhaps one of historical preparation, but 
not at all of theoretical contribution” (1965 : 303). However, Walzer is unable 
to identify a set of political or institutional factors that could account for the 
Puritans’ combination of psychological successes and political defeats. Gold- 
stone (1987), as noted above, also sees Protestantism’s contribution to liberal- 
ism as indirect; even though Puritan ministers sought to enforce a new 
orthodoxy, they had the effect of forcing the state ‘to tolerate religious 
pluralism and thereby they stimulated cultural and technological innovation. 

Walzer’s thesis is opposed by Christopher Hill (1963, 1972), the only 
contemporary Marxist who has engaged in a sustained study of Protestant- 
ism’s role in the development of English capitalism. Hill argues that Prot- 
estantism gave rise ‘to a libertarian communist, as well as a repressive 
capitalist, ideology. Only after the English bourgeoisie triumphed politically 
over peasants and proletarians did Protestantism become a guide to action in 
an actually existing society. 

Gould (1987) uses a Parsonian vocabulary in an effort to integrate a 
Weberian sense of Puritanism with a Marxist analysis of modes of production. 
Gould argues that Puritanism became a guide to rational economic action only 
after the early development of what Gould calls a “manufacturing social 
formation” made such action efficacious. By manufacturing social formation, 
Gould means the employment of waged labor under the direction of capitalists 
and supervisors, as described by Marx in. Capital. Whereas the drive for 
capital accumulation is immanent in the economy of industrial capitalism, it is 
not in the manufacturing social formation, and the Protestant ethic provides 
that otherwise absent drive. Gould mistakenly argues that an arbitrary absolut- 
ist government limited capitalists in their ability to further their interests and 
to engage in the economic behaviors legitimated by their religion. In fact, in 
the century prior to 1640 English landlords were able to raise rents faster than 
prices and to dispossess peasants (Lachmann 1987: 100-141). Gould argues 
that the motivation for, and success of, the 1640 Revolution arose from the 
contradiction of economy and polity. Subsequent efforts, in the period 1641— 
1688, to further or to repeal revolutionary actions were constrained by the 
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varying ways in which the English used their religion to derive goals appropri- 
ate for action in the new institutional contexts created by previous revolution- 
ary actions. Gould contrasts the ways in which the Protestant ethic was 
applied variously by royalists, Parliamentarians, and radicals. He explains 
how the ultimately triumphant gentry interpreted their religious ethic to justify 
a limited monarchy, a strong Parliament, and local autonomy for property 
holders. 

The issues of Protestant ideology, which have been addressed by Walzer, 
Hill, and Gould, and critiqued by Goldstone, have not been at the center of 
debates on modernization, nor, as subsequent sections make clear, have they 
been key to debates on class and the state. Unfortunately, most recent 
scholarship in this area has not engaged the theoretical and empirical prob- 
lems posed by the interactions of beliefs and practices, and of ideology and 
social structure. 


THE TRANSITION FROM FEUDALISM TO CAPITALISM 


Marxist scholars frame the problem by asking whether the conflict of ele- 
ments within feudalism led to the demise of that mode of production and the 
formation of a new one, or if feudalism was a static social formation un- 
dermined by actors and processes ‘external to it. Attention has centered upon 
the changes in landlord-peasant class relations that occurred in the century 
following the Black Death of 1349. In much of Western Europe aristocratic 
control over peasant labor weakened as tenants won personal freedom and 
mobility, even if their economic obligations to landlords remained heavy. 
Most Eastern European peasants, by contrast, were not able to take advantage 
of the post-plague conditions of labor shortage, and instead they were reen- 
serfed by their lords. 

Maurice Dobb (1947) defines feudalism in terms of its relations of produc- 
tion and therefore views the demise of serfdom in England and France as a 
vital indicator that feudalism was beginning its decline in those two countries. 
Dobb argues that factors internal to feudalism, primarily peasant and 
aristocratic class solidarity (with the latter mainly circumscribed by landlords’ 
abilities to profitably employ serf and free labor in different ecological zones 
under particular demographic and technological regimes), determine when 
and where peasants gain the freedom to develop a “petty [commodity] mode 
of production” (1947:85) under the continuing political rule of the aris- 
tocracy. 

Dobb argues that agricultural and handicraft production were transformed 
most radically by those producers who were both excluded from aristocratic 
and guild privileges and free from feudal restrictions on the employment of 
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their labor and their property. However, producers’ abilities to accumulate 
capital and to transform the petty mode of production into genuine capitalism 
were limited by unfair competition from guilds and mercantile monopolies 
and from lords who continued to collect rent under the protection of the feudal 
state. Dobb argues that the bourgeoisie’s victory in the English Civil War 
freed the means of production from feudal controls, allowing “England . . . to 
accelerate enormously the growth of industrial capital in the next half- 
century—a growth surpassing that of other countries which as yet lacked any 
similar’ political upheaval” (1947: 176). 

Dobb’s contention, echoed by most Marxists, that capile first de- 
veloped in England has been challenged by champions of Italy and France. 
Cohen (1980) argues that Renaissance Italian merchants exhibited a commit- 
ment to profit through their use of rational business methods. However, 
Cohen conflates rational technique, which some fifteenth and sixteenth cen- 
tury Italians did practice, with rational economic action, which was not a 
characteristic of most Italian enterprises. Instead, Renaissance Italians pur- 
sued what Weber calls politically oriented capitalism, and they sought wind- 
fall profits through war and political alliances. 

Leon (1970) contends that during the eighteenth century the rates of 
economic growth in France exceeded those in England. Such data, even if 
correct, do not demonstrate that prior to 1700 France also underwent a 
transformation in the relations of production equivalent to that of sixteenth 
century England. Elsewhere (Lachmann 1987:16-17), I argue that capital- 
ism, if defined as private property in land and proletarianized labor, de- 
veloped in England in the century from 1536 to 1640, well before any other 
country. An equivalent agrarian transformation began in a few regions of 
France in the late seventeenth and eighteenth centuries but occurred in much 
of France only after 1789. 

Dobb measures the effect of the bourgeoisie’s ascension to political power 
with a clear, if standard, Marxist analysis of how English capitalist develop- 
ment accelerated after the Revolution, especially in comparison with those 
European countries still under aristocratic rule. However, the importance for 
economic development of which class held political power makes it vital for 
Dobb to explain how a small English bourgeoisie still in formation was able to 
overthrow the aristocracy more than a century before its French counterparts 
did. Dobb’s failure to clarify feudal political dynamics has had two con- 
sequences. Some Marxists have turned their attention to the problem of state 
formation. Their work is considered later. However, many Marxists have 
sought to analyze the development of capitalist markets as a process external 
to feudalism. That tendency has drawn upon the non-ideological aspects of 
Weber’s discussions of cities and markets and is addressed in the following 
section. 
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Exchange, World Markets, and Proto-Industry 


Marxist studies of the market have developed from Sweezy’s ({1950] 1976) 
critique of Dobb. Sweezy challenges Dobb’s archetypal Marxist vision of the 
transition to capitalism as the outcome of a struggle between two classes, each 
with a vested interest in a particular form of property and in particular 
relations of production. Sweezy is unconcerned with the dynamics of feudal- 
ism, which he believes neither fetters nor generates capitalist development. 
Instead, he describes feudalism as production for use and capitalism as 
production for the market. Thus, Sweezy argues that capitalism arose in an 
urban, market sector outside the control of feudal lords. Where Dobb sees 
capitalism as fettered by the class interests and the power of a feudal aristocra- 
cy, Sweezy argues that production for the market is inherently more profitable 
than realizing a surplus through feudal appropriation in a natural economy. 
Sweezy contends that “pre-capitalist commodity production” developed in the 
fourteenth through sixteenth centuries and was superseded by full-fledged 
capitalism thereafter, because efficient producers were able to outbid feudal 
rivals for the inputs of land, labor, and capital, regardless of the relative 
political power of aristocrats and merchants. 

Sweezy’s argument suffers from a degree of circularity. He is unable to 
explain why markets and cities were more developed in Italy and in north- 
western Europe than in the eastern and peripheral regions of the continent, 
except by referring to the earlier development of markets in those regions. 
Nor, if existing markets are the main predictor of future capitalist develop- 
ment, can Sweezy explain how the once backward market sector of England 
overtook the older mercantile cities in the seventeenth century. 

Ironically, North & Thomas (1973), who adopt the perspective of mod- 
ernization theory, offer a far more political explanation for England’s early 
market preeminence than does Sweezy who defines himself as a Marxist. 
North & Thomas analyze how the political interests of feudal classes and state 
rulers inhibited the development of private property and of efficient market 
organizations. By acting in their own particular interests, premodern Eu- 
ropeans elevated the search, negotiation, and enforcement costs of exchange, 
which kept the private rate of return from investment far below the social rate 
of return. In North & Thomas’s view, English entrepreneurs enjoyed a higher 
level of economic understanding and fortuitous political circumstances, which 
allowed them to demand from the state private property rights, and patents to 
protect their property in technical knowledge. These innovations increased the 
return to trade and production, sparking English industrial capitalism. 

While North & Thomas identify some of the necessary political causes of 
capitalism, they are unable to explain why the feudal opposition to private 
property was overcome in England and not elsewhere. A similar problem 
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befalls deVries (1976) and Wrigley (1987). DeVries offers a masterful history 
of the divergent paths of economic stagnation and development in Europe 
between 1600 and 1750. Like North & Thomas he attributes the different 
outcomes to government policies, yet does not analyze the sources of those 
policies. DeVries is especially convincing at showing how innovations in 
banking, putting-out, and local and transnational markets were critical to 
capitalist development. Wrigley traces the effects of London’s urbanization 
upon growth throughout England, and upon rationalizing shifts in marriage 
and childbearing decisions. Yet neither deVries nor Wrigley explains why 
those innovations were confined to particular nations, nor how some states 
pursued policies that served to organize markets on a national level. 

Wallerstein’s (1974, 1980) world system theory addresses a number of the 
shortcomings in the market approaches of Sweezy and the non-Marxists, 
while sharing their definition of feudalism as production for use. Wallerstein 
defines the bourgeoisie in terms of market opportunities. Bourgeois classes 
emerge where and when the technical means of production, communication, 
and transportation allow that class to take advantage of differences between 
the costs of production in one region and the price of goods in other areas of 
the world market (1974:36). Thus, the timing of technological innovation 
determines the moment of initial capitalist development in Wallerstein’s 
model. 

Wallerstein argues that capitalists can profit from exchange with noncapi- 
talist systems, e.g. by exploiting slaves in Africa or in the Americas, or 
through buying the products of serf labor in Eastern Europe. In Wallerstein’s 
model, slaves and serfs, and slave owners and feudal lords, become part of a 
capitalist system of production merely by taking their place at the bottom of 
commodity chains that end in mercantile cities. 

Wallerstein brings politics into the market model of capitalism by recogniz- 
ing that state power was needed to control the producers of commodities and 
to secure domination over the markets in which commodities were sold for 
profit. Thus, Wallerstein argues that the main advantage that the core coun- 
tries of the sixteenth century European world system enjoyed over the Italian 
city-states which had previously dominated European trade was the national 
form of their state, rather than any improvement in the forces or techniques of 
production they employed, or in the rationality of the ideology which moti- 
vated their economic actions. Strong nation-states were able to enact mercan- 
tilist policies that served to organize markets on a national basis, thereby 
directing demand for finished goods to favored producers in core countries. 
Wallerstein makes an important contribution in presenting the formation of 
national markets as a political process, even if he is unable to explain why 
particular states were able, or failed, to sustain mercantilist policies. He 
employs a multiplicity of factors—geographic location, ecology, and es- 


58 LACHMANN 


pecially the balance of class forces——to explain variations in the type and mix 
of agricultural and manufactured commodities produced across the regions 
and political units of Europe, and therefore the initial core, semi-peripheral, 
or peripheral position of each European nation. 

The rulers of each state gain access to certain levels of resources and must 
find allies and fend off opponents from among the particular array of class 
actors created by their nation’s position as the producer‘and purveyor of 
commodities in the world system. Wallerstein’s analysis is functional in that 
he assumes state and class actors have relatively little difficulty in adopting 
policies that allow them to find and maximize the return from their appropri- 
ate niche in the world system. Wallerstein’s discussions of “struggle in the 
core” for primacy in the world system (1980:74—125, 244-289) attribute 
shifts in the pecking order among*European nations to evolution in the overall 
size and structure of the world system. 

The weak role of class agency in the world system model can be gauged ‘by 
comparing Wallerstein’s discussions of the sixteenth and the seventeenth 
centuries. While, in his (1974) discussion of the formation of the world 
system in the sixteenth century, Wallerstein acknowledges that preexisting 
class interests: were critical to a nation’s entry into the world system, in his 
second (1980) volume on the seventeenth century consolidation of the Eu- 
ropean world economy, Wallerstein rarely recognizes the capacity of any 
class to successfully pursue interests at variance with those determined by the 
logic of its nation’s place in the world system. Already in the first volume, as 
Skocpol (1977) points out in her penetrating review essay, Wallerstein 
assumes rather than explains the existence of transnational markets and of 
market rationality. 

Research by students of “proto-industrialization” challenges Wallerstein’s 
pessimistic view of the possibilities for class agency within early modern 
European markets. Kriedte et al, following Tilly & Tilly (1971), define 
proto-industrialization as “rural . . . industrial mass production for in- 
terregional and international markets” (1981:6). In pointing to rural proto- 
industry as the crucial engine of capitalist development, Kriedte et al are 
modifying Weber’s (1978 : 1236-65) and Sweezy’s (1976) depictions of the 
city as a realm of freedom. Instead, Kriedte et al argue that while medieval 
cities helped peasants escape feudal bonds, as Sweezy contends, cities also 
blocked further growth of industry “because in the urban economy the supply 
of labour and materials was inelastic and was kept that way by the economic 
policies of the guilds” (1981:7). 

Kriedte et al offer a resolution to the Dobb-Sweezy debate by showing that 
the absence of feudal fetters, in some rural areas but not in towns, was a 
necessary precondition for proto-industry. Thus, Kriedte et al (1981) and 
Kriedte (1983) urge scholars to look at the specific interplay of demographic 
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regimes, geographical comparative advantages, and rural class relations 
which generated often tiny islands of proto-industry within otherwise feudal 
regions. States, and the world system’s expansion, undermined feudalism by 
increasing the overall demand for proto-industrial production; however, nei- 
ther national nor international structural development predicts where proto- 
industrialists could benefit from, and further, the disintegration of feudalism. 

Kriedte has a stricter definition of capitalism than does Sweezy or Waller- 
stein. For Kriedte it is not enough that commodities are sold on a market. The 
commodities must be produced with labor and capital that have been freed 
from feudal controls. The advantage of Kriedte’s definition and method of 
historical analysis over those of Sweezy and Wallerstein ‘becomes apparent 
when one addresses the conundrum of capitalist development, raised by Eric 
Hobsbawm ({1954] 1965). Hobsbawm points out that the economic crisis that 
affected all of Europe in the seventeenth century forces one to ask “Why did 
the expansion of the later fifteenth and sixteenth centuries not lead straight 
into the epoch of the eighteenth and nineteenth century Industrial Revolution? 
What, in other words, were the obstacles in the way of capitalist expansion?” 
(1965:14). To redirect Hobsbawm’s question toward Wallerstein’s later 
work, we might ask why the initial beneficiaries of unequal exchange in the 
world market, the Italian city-states, Spain, Portugal and the Netherlands, 
couldn’t convert their profits into capital for investment in industrial produc- 
tion? g 

Hobsbawm answers his questions and ours by pointing out that “So long as 
there is no large body of wage-workers; so long as most men supply their 
needs from their own production or by exchange in the multiplicity of more or 
less autarkic local markets which exist even in primitive societies, there is.a 
limit to the horizon of capitalist profit and very little incentive to undertake 
what we may loosely call mass production, the basis of capitalist industrial 
production” (1965: 15). Where such a transition from feudalism to capitalism 
did not occur, as in Italy, Spain, and Portugal, one got what Weber 
(1978: 164-66, 193-201, 1961:246-47) calls “politically-oriented capital- 
ism,” an orientation to the profit opportunities obtained through political 
domination, predatory activity, and irregular transactions with political bod- 
ies. Those were the only sorts of profits the original core countries could make 
from exploiting slaves in Africa and in the Americas or by buying the 
products of serf labor in Eastern Europe. 

Only after proletarianization created a mass market could capitalists make 
continuous profits from goods produced in the periphery alongside the far 
more profitable market for mass-produced goods created by the proletarian 
workers themselves. While Hobsbawm identifies proletarianization and the 
mass market as the key preconditions for capitalist expansion, his essay is too 
brief to suggest an adequate framework for finding the processes through 
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which, and the sites where, capital was concentrated. Such a study can begin 
on the micro level proposed by the students of proto-industry. But it must be 
complemented, as Hobsbawm suggests, by identifying the factors that de- 
termined who gained and who lost control over capital during the seventeenth 
century crisis. 


Feudal Class Conflict and Capitalist Origins 


A renewed interest in the dynamic of feudal class conflict has yielded several 
suggestive answers to the questions of how a bourgeoisie emerged first in 
England and why that class used its power to foster mass production and to 
dominate international markets. Barrington Moore, Jr. (1966) looks to agrari- 
an class struggle to explain the different political, and by implication eco- 
nomic, systems created in England, France, and Germany, and in more recent 
non-European cases. Moore identifies the relative weakness of the English 
ruling class—with neither a strong crown as in France, nor a powerful 
aristocracy as in Germany—as the reason for that class’s inability to sustain 
feudal surplus extraction. 

Moore argues that the demise of feudalism opened the way for yeomen, 
drawn from the ranks of landlords and from the wealthier strata of peasants, to 
take advantage of international market demand for wool by converting grain 
lands to pasture. Moore does not explain why sixteenth century peasants 
failed to resist their proletarianization, nor does he acknowledge that peasants 
continued to lose their leaseholds even after the market for wool declined. 
Moore believes that commodity markets “generated a powerful stimulus to the 
growth of commercial and even capitalist outlook in the countryside” 
(1966: 6). However, Moore merely presumes that those factors led to capital- 
ist interests and practices without explaining why. 

Robert Brenner (1976, 1982) defines feudalism and capitalism in terms of 
relations of production. In contradistinction to what he (1977) calls “neo- 
Smithian Marxists” who stress market relations, Brenner sees the relative 
class strength of peasants and landlords as the key variable. He argues that 
lords everywhere sought to reenserf peasants following the Black Death. In 
Western Europe, where peasant community solidarity was strong, landlords 
failed; in Eastern Europe isolated peasants were unable to enforce their svong 
formal land rights and prevent imposition of a second serfdom. 

By focusing upon the internal class dynamics of each country, Brenner is 
able to refute Wallerstein and Sweezy’s assumption that feudal production 
and class relations remained static until penetrated by external forces. For 
Eastern Europe, Brenner draws a clear causal line from class struggle to the 
second serfdom and to that region’s backward export economy. The ability of 
the Eastern European lords to constrict the mobility and income of their 
tenants eliminated the labor supply and home market for industry; their 
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reliance upon forced labor arrested the development of productivity and skill 
on Eastern European estates centuries before the formation of a world market. 

Brenner argues that the social-property systems formed out of the post- 
plague class struggles determined the form of “ruling-class self-organization,” 
i.e. of the state (1982: 69), as well as the direction of economic development. 
Here Brenner concurs with Perry Anderson who states that “one of the basic 
axioms of historical materialism [is] that the secular struggle between classes 
is ultimately resolved at the political—not at the economic or cultural—level 
of society” (1974: 11). Both agree that differences in the form and develop- 
ment of Eastern and Western. European, and of the French and English, 
absolutist states in turn affected subsequent economic development and class 
formation. 

Powerful nobles in Eastern Europe subordinated independent cities (here 
Brenner and Anderson disagree with Sweezy and Wallerstein as well as with 
Chirot and Hall in seeing weak cities as consequences rather than causes of 
the outcomes of agrarian class struggle) and starved monarchs of revenues. 
After monarchs and urban elites had lost their autonomy and peasants were 
reenserfed, the principal challenge to Eastern European seigneurs came from 
other nobles who attempted to conquer rivals’ territories as the only remaining 
way to expand their incomes. As a result, Eastern nobles quickly consolidated 
themselves into ever larger geographical units (Brenner 1982: 38-41, 75-76). 
Anderson, like Moore, argues that the continuing threat of peasant rebellion 
was an even more vital reason for Eastern European nobles to-unite in strong 
States. 

The differing outcomes of postplague class struggles created’ more auspi- 
cious conditions for Western European kings to build the sorts of autonomous 
absolutisms which Anderson believes fostered bourgeois class formation. 
Most critically, peasants retained personal freedoms and land rights. Kings 
could protect and tax peasant farms, and this served to limit noble resources 
while providing the state with revenues to build armies and bureaucracies 
independent of their nobles. Autonomous towns, which survived the failed 
seigneurial offensives, offered further allies and resources for the construction 
of absolutist states autonomous from feudal lords. The contrast between 
Eastern and Western European absolutism is apparent from the development 
of noble estates. The estates retained control of the state in the east; in the 
west they steadily lost legal and fiscal powers as nobles were reduced to 
officeholders and pensioners dependent upon state favor. 

Anderson and Brenner believe that fourteenth-century class conflicts set 
Eastern and Western Europe on different, though unilinear, paths of state 
formation and therefore of economic development. The difficulties with their 
formulations emerge when they try to explain how a bourgeoisie formed 
under absolutism and why capitalism developed sooner in England than in 
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France. Brenner argues that France failed to develop economically in the 
sixteenth through eighteenth centuries because state protection of peasant 
tenures prevented property accumulation while encouraging population in- 
crease. Surplus extraction was increasingly carried out through taxes which 
fed the military and bureaucracy rather than economic projects (1982: 76-83, 
91-94). 

Brenner correctly contends that the decentralization of power among En- 
glish magnates, lesser nobles, and the crown prevented the development of a 
strong absolutism similar to that of France in the fifteenth and subsequent 
centuries. A weak state precluded English nobles from squeezing more taxes 
from peasants and ensured their defeat in foreign wars. The English aristocra- 
cy then turned to a “zero-sum game” of attacking one another in civil war. 
When that failed, English landlords used “their continuing political control 
over land [which] proved to be their trump card” to dispossess the peasantry 
(Brenner 1982:84). Brenner fails to explain how the English aristocracy in 
the sixteenth century suddenly came to understand that private property in 
land and proletarian labor would lead to “specialization and improvement . . . 
an agricultural transformation” (1982:88). Nor does he explain why English 
peasants, who had turned back a seigneurial offensive in the fourteenth 
century, fell victim to a land grab by a landlord class which Brenner views as 
even less well-organized in the sixteenth century. 

Anderson differs from Brenner in his belief. that English and French 
absolutism both aided capitalist development in similar ways: merchants 
accumulated capital by investing in state offices and monopolies and by 
selling goods to the state; and the state’s interest in reviving Roman law (or, in 
England, in codifying common law) for its own administrative purposes 
created a mechanism the bourgeois could use to guard their private property 
rights against appropriation by aristocrats or by the state..On this latter point 
Anderson is in accord with Weber (1978 : 809-10) and Chirot (1985) that the 
law contributes to general capitalist rationality beyond its specific origins in 
and use for guarding the particular interests of state officials or of an eco- 
nomic class. 

Anderson’s analysis is a clear advance upon earlier Marxist scholars (e.g. 
Hill 1967) who presume that classes develop on their own in the realm of 
production and that the state merely reflects the balance of class power in civil 
society. By distinguishing between mode of production and social formation, 
Anderson provides a mechanism through which absolutism can undermine the 
interests of the aristocratic class which formed and staffed it. As a result, 
Anderson is suggesting that bourgeois class formation was an unintended 
effect of absolutism, rather than claiming that a rising bourgeoisie gave the 
absolutist state a relative autonomy from and power over the aristocracy. 
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Advocates of that latter view—which began with Engels ({1884] 1972) and 
was elaborated by Dobb (1947:161-76), Porchnev (1963), Lublinskaya 
(1968), and in a somewhat different way by Poulantzas (1975)—-suffer from 
an inability to identify instances of bourgeois-state alliances against the 
aristocracy. Nor do they offer a mechanism through which the bourgeoisie 
gained independent power. Those failures allow Trevor-Roper ({1959] 1965) 
to argue that seventeenth century antistate rebellions were neither bourgeois 
revolutions, as Hill claims for England, nor aristocratic reactions, as argued 
by relative autonomy theorists. Instead, Trevor-Roper mistakenly describes 
such rebellions as.no more than conflicts between a “court” composed of 
self-interested state officials and the “country”, which was Torea to bear the 
increasing burden of taxes. 

Anderson suggests that the lines of seventeenth century conflict can be 
analyzed in terms of the particular groups of aristocrats advantaged and 
harmed by each country’s absolutism in successive historical eras. Un- 
fortunately, Anderson does not undertake such an analysis himself. Nor is his 
model of absolutism sufficiently developed to spėcify the state’s different 
effects upon bourgeois class formation in England and France; to do so would 
close the gap in Brenner’s argument by explaining how the capacities and 
interests of English landlords were transformed in the sixteenth century. 
However, Anderson’s work focuses attention upon absolutism as the political 
institutionalization of evolving feudal class relations rather than as merely the 
instrument of an aristocratic class. His effort builds the foundation for future 
work which could explain the discrepancies, highlighted by critics of Marx- 
ism, between revolutionary actors and their purported class interests by 
looking at the inadvertent structural effects of the state as social formation. 

John Merrington’s (1978) analysis of the role of towns in the transition to 
capitalism suffers from a problem similar to that of Anderson’s study of 
absolutism. Like Anderson, Merrington argues that the Dobb-Sweezy debate 
falsely dicotomizes the relationship of cities and feudalism. Merrington views 
cities and their development as internal to feudalism yet ultimately creating a 
bourgeoisie with interests opposed to those of the feudal ruling class. Thus, 
for Merrington, cities eventually become the site of a revolutionary challenge 
to feudalism. However, Merrington’s and Anderson’s schema fail to identify 
a point when, or a reason why, a bourgeoisie that thrived within feudalism 
and through absolutism came to oppose their aristocratic customers and 
sponsors. 

Gintis & Bowles (1984) provide a bridge to Weber’s own (1978: 901- 
1110) analysis of the state by highlighting the economic consequences of a 
developing absolutist state bureaucracy. They argue that once political power 
was concentrated in state structures, the essential character of feudalism—the 
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fusion of political and economic powers in manors—was lost. Thus, absolut- 
ism, in Gintis & Bowles’s view was a new articulation of state and economy: 
“If the fundamental form of surplus extraction is no longer feudal, the state 
cannot be considered an agent of feudal reproduction, whatever its role in 
sustaining the preeminence of aristocratic families” (1984: 40). 

Gintis & Bowles are better able than Anderson to articulate why the 
aristocrats who manned absolutist states undermined their own interests. In 
the course of building state revenues for the purpose of protecting class 
privileges, state managers inadvertently weakened seigneurial powers and 
Strengthened peasant and town legal rights. However, while Gintis & Bowles 
describe the general effects of absolutism, they cannot pinpoint the reasons 
for England’s divergence from that pattern: why England developed capital- 
ism instead of prolonging the transitional articulation Gintis & Bowles iden- 
tify elsewhere in Europe. 


STATE AUTONOMY AND CAPITALIST DEVELOPMENT 


Much of the most useful research on the relationship between state formation 
and capitalist development has come from scholars who doubt that absolutism 
was a state of, by, and for the aristocracy and instead make the Weberian 
assumption that states are run by self-interested and therefore autonomous 
officials. Otto Hintze ({1902—1906] 1975) and Frederic Lane (1958, 1979) 
have been influential in developing this line of analysis. Hintze sees the state 
as primarily a military organization. State power is based on the ability to 
draw manpower and other resources for war from the country at large. The 
transition from the feudal to the modern military state is a result of the state’s 
ability to conscript men directly from the national population. This serves to 
undercut the position of the nobility and strengthens that of an autonomous 
state elite. Hintze is unclear as to why aristocracies agree, or how they are 
forced, to subordinate their class interests to the geopolitical needs of their 
nation, nor does his work provide a guide for understanding how modern 
military states contribute to capitalist development. 

Lane uses economic terms to analyze relations between states and their 
subjects. He traces the ways in which such political exchanges affect other 
types of economic relations. Lane argues that aristocrats and merchants alike 
need to buy protection from violent attack, and he defines states as organiza- 
tions that collect taxes for the expense of policing and maintaining a monopo- 
ly on force within a territory. The state benefits to the extent that it can collect 
taxes above the costs of providing protection services. Lane calls that profit 
“tribute.” Merchants make a form of profit Lane calls “protection rents,” 
which accrue when taxes are less expensive than losses from attack and are 
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lower than the protection costs incurred by rivals. Lane argues that “During 
the Middle Ages and early modern times protection rents . . . were a more 
important source of profits . . . than superiority in industrial techniques -or 
industrial organization” (1958 : 410). 

Lane offers a stage theory of history from 700 Ap to 1700 Ap based on the 
“changing relation of violence-using enterprises to the amount and distribu- 
tion of surplus” (1958:412). Initially, armed men established their own 
tribute-collecting enterprises. These feudal lords had little incentive to control 
costs. Only later when such lords were brought within larger state enterprises, 
and their land rent became separate from taxes, did kings have reason to try to 
control military costs and thereby to raise the royal share of surplus. Finally, 
the profits from trade and technical innovation came to exceed those from 
protection; only then did the customers (citizen voters) take control of states 
and their armed employees. 

McNeill (1982) contends that the European multistate system, in contrast to 
the Chinese empire, fostered trade and increased competitive demand for new 
weapons. McNeill traces how different nations gained continental, and later 
world, power from the use of new military technologies and strategies. Yet he 
is not able to explain why some countries failed to develop the bureaucracies 
and procapitalist policies needed to organize and pay for the means to 
compete in European wars. 

Lane’s suggestive model has been fleshed out for France, and recently for 
other European nations, by Charles Tilly. Tilly (1985) views European 
monarchs as Mafia chieftains who use their military might to threaten war 
upon peoples and territories unless they pay “protection” in the form of taxes. 
Taxes, in turn, provide the revenues to support ever larger military forces. For 
Tilly, and for Michael Mann (1980, 1986) who advances a similar argument 
for England, the monarch and his subordinate staff form a self-interested elite 
who build a state apparatus to augment their power and their share of the 
national income at the expense of nobles and peasants alike. 

A similar model is proposed by Levi (1983). She believes that rulers want 
to maximize their wealth and power. They are constrained in their ability to 
do so by the “terms of trade” between rulers and ruled, i.e. by subjects’ 
Capacities to resist rulers’ demands and by the costs incurred in trying to 
increase revenues, Thus, the form of the state and “differences in state 
policies will primarily reflect differences in constraints on the ruler” (Levi 
1983 : 223). The key constraints are the mode of production and class structure 
which determine subjects’ capacities to resist, previous constitutional arrange- 
ments between rulers and ruled, and the international pressures on the state. 
Levi believes that in the long run rulers increase their coercive power through 
the use of improved technologies for monitoring subjects and their own hired 
agents. 
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Levi tests her model with a comparison of taxation in England and France 
from the twelfth through seventeenth centuries (1988 : 95-121). She exposes 
the key “paradox [of that comparison]: The relatively weaker bargaining 
power of English monarchs vis-a-vis their constituents led to concessions that 
French monarchs did not have to make. However, the Parliament that evolved 
ultimately enhanced the ability of monarchs to tax” (1988:97). Levi argues 
that an almost permanent state of foreign war served to forge ideologies of 
nationhood and of the crown as the guardian of national interest. Acceptance 
of those ideologies fostered “quasi-voluntary compliance” in both countries, a 
compliance that was further increased in England because the crown’s 
negotiation with Parliament “enshrouds a ruler’s policies in legitimacy” 
(1988: 118). 

Levi is on shakier ground when she argues that English kings were forced 
into negotiation because “English landlords and capitalists were stronger than 
their French counterparts. Consequently, English monarchs had worse terms 
of trade with capitalists (some of whom were nobility) than did the monarchs 
of France” (1988:111). English capitalists gained that strength only in the 
sixteenth century, while the tax systems of the two countries were formed, as 
Levi points out, in the previous three centuries. Levi’s model misses the ways 
in which the tax and other policies of English monarchs led English landlords 
to define their interests in capitalist terms, and it lacks the specificity to 
explain how an initially weak French state implicated autonomous economic 
actors in its finances and eventually fostered countervailing capitalist in- 
terests. 

Tilly addresses that latter issue. He is concerned with explaining variations 
in the sorts of “bargains” states struck with their subjects to secure the 
resources needed for war, and the effects of those bargains upon the organiza- 
tion of production. The terms of bargains depended upon the relative power 
and resources of the state on the one hand and “newly conquered peoples .. . 
actual or potential rebels . . .. and lenders, taxpayers or sources of military 
wherewithal” on the other (Tilly 1987a: 11). Kings were forced to grant their 
most powerful subjects—armed nobles and long-established regional and 
corporate bodies—certain rights within, and a measure of control over, the 
state in return for access to the resources which they controlled. Thus, the 
most powerful subjects gained “citizenship” within their forming nations as 
they surrendered resources to the crown. 

Fhe extent to which rulers needed to make bargains with their subjects 
depended upon the costs of war, which rose inexorably over the centuries and 
escalated rapidly with technological innovations and with the development of 
mass conscription in the late eighteenth and nineteenth centuries. The sorts of 
bargains struck depended upon the relative concentration of coercion and 
capital in each political unit. “A ruler. . . in a relatively uncommercialized . 
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agrarian economy such as Russia had little choice but to enlist the support of 
landlords, to conscript peasants, and to tax property, a process that ordinarily 
created massive bureaucracies. A commercialized maritime state . . . could 
rely on customs and excise (which are relatively inexpensive to collect) and 
emphasize seapower without forming a vast central administration; the strat- 
egy could only work, however, with the close collaboration of merchants and 
financiers” (Tilly 1987b:3). Tilly argues that the initial opportunities open to 
a ruler determined the path of state formation and constrained later strategies 
for state formation. Maritime states, such as Venice and Genoa, that gained 
an early advantage from their ready access to capital were unable to build their 
coercive power and enlist large standing armies and therefore “lost out to the 
substantial states of the Atlantic,” such as England and France, which were 
able to conscript armies (Tilly 1987b:5). Thus, state power shaped class 
relations and determined the form of the dominant class’s access to in- 
ternational markets. 

Tilly’s analysis is complemented by Blockmans (1978), who traces the 
ways in which the holders of capital and armed force were organized in 
“representative institutions” in late medieval Europe. Blockmans argues that 
the ways in which monarchs recognized the rights of representative in- 
stitutions were shaped by the preexisting bases of power held by the members 
of such institutions as rural landlords or as urban corporate bodies. Institutions 
maintained or lost power as monarchs failed or succeeded in playing off rural 
and urban interests against one another. Representative institutions were 
divided and weakened when key members gained greater power and income 
from royal office than they did from upholding the group privileges of the 
members of each body. 

Blockmans identifies five main types of representative institutions: O 
assemblies of rural landlords which protect nobles’ fiscal interests but ex- 
ercise decreasing power over other state functions; (ii) assemblies combining 
rural and urban elites which are able over the long-term to intervene in a 
variety of issues and preserve their power; (iii) urban institutions under only 
nominal royal control; (iv) leagues of independent but isolated cities eventual- 
ly overwhelmed by surrounding nobles and monarchs; and (v) independent 
city-states racked by fights among factions often allied with foreign powers 
(1978 : 202-09). Blockmans’s scheme identifies the points of division within 
and between representative institutions and thereby offers a basis for com- 
parative analyses of monarchs’ efforts to defeat and incorporate formerly 
independent aristocracies and corporate bodies. Blockmans’s model provides 
one axis along which one can analyze the foundations for states’ “bargains” 
with organized groups of subjects, and the forms through which particular 
classes succeeded or failed in protecting their interests. 

Rokkan (1970: 72-144), while concerned with the establishment of multi- 
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party democracies rather than with the origins of capitalism per se, is close to 
Blockmans in arguing that the timing of territorial consolidation during the 
Middle Ages, and the form of state which emerged, are determined by the 
strengths and interrelations among formerly autonomous representative in- 
stitutions. Rokkan differentiates nations in terms of the continuity of medieval 
representative bodies into the modern era, the extent of political centraliza- 
tion, and the timing of the emergence of new bourgeois and proletarian 
polities. Rokkan argues that the effect of those factors upon later political (and 
economic) development was set during three “critical junctures:” the state- 
church conflicts following the Reformation, the “national revolutions” which 
established modern states, and the era of industrialization which established 
the modern class character of each country (1970: 112-20). 

Tilly in his study of The Contentious French (1986) has given theoretical 
depth and historical substance to the process outlined by Blockmans. Tilly 
argues that as once independent landlords were defeated and/or incorporated 
within absolutism as office holders, and as taxes grew in importance in 
relation to rents during the seventeenth century, state officials and taxpayers 
replaced lords and peasants as the principal antagonists in rural conflicts 
(1986: 119-61). Tilly traces the ways in which the state’s appropriation of 
resources and redefinition of interests transformed the forms of contention as 
well as the opportunities for elite and popular mobilization. 

Tilly shows how the state’s appropriation and expenditure of capital aided 
an ongoing process of capitalist development. State taxation concentrated 
resources at a national level, creating (through military procurement and the 
demand for luxury goods) the initial markets for capitalist enterprises. More 
important in Tilly’s historiography is the effect of state tax demands upon the 
French agrarian economy. Rising taxes increased peasants’ need for cash, 
forcing them to sell a growing portion of their produce and often their labor in 
markets. When peasants could not realize enough cash they fell into debt, 
eventually losing their land in bankruptcy sales to the bourgeoisie. 

Through the dispossession of the peasantry, the French state was indirectly 
responsible for the freeing of land for capital and for the concomitant genesis 
of a labor market (Tilly 1981: 191-210; Fitch 1978: 194-206 makes a similar 
argument). Here Tilly offers an important qualification to Brenner’s conten- 
tion that the crown protected French peasants from dispossession and thereby 
retarded agrarian development. While Brenner is correct for the sixteenth 
century, Tilly accurately describes a shift in state strategy during the seven- 
teenth and eighteenth centuries away from upholding peasant land rights and 
toward allowing tax officials to use tax arrears to appropriate peasant lands. 

While Tilly has confined his analysis of state formation to France, Mann 
(1980) has attempted to develop a similar model for England, and more 
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recently (1986), to embed a discussion of state formation within a study of 
The Sources of Social Power, primarily in Europe, from the beginning of 
history to 1760. One difficulty Mann confronts is that since capitalism 
developed so much more rapidly in England than in France it is difficult for 
him to argue that the state’s fiscal and military demands played a major role in 
that transition. Instead, Mann views the English state as almost exclusively an 
international military actor which did not create a bourgeoisie, but merely 
“pushed classes toward a national form of organization” (1980:203). 


NEW APPROACHES TO FEUDAL STRUCTURE AND 
CAPITALIST AGENCY 


Mann (1986) at times echoes Hall (1985), as when he argues that Christianity 
aided market exchange by providing a common culture across Europe, but he 
does not explain the different effects of that religion in Eastern and Western 
Europe, and over time. Mann’s historiography remains deficient at various 
points, most notably in his failure (1986: 450—99) to explain why English 
monarchs were forced to make “constitutional” concessions to independent 
elites while French rulers were able to draw aristocrats and corporate bodies 
within an absolutist state. Mann neither identifies the sources for the differing 
capacities of the monarchs in the two countries, nor analyzes how royal 
strategies affected the internal composition of rival elites. However, the 
theoretical framework offered in The Sources of Social Power distills what is 
most fruitful in the work reviewed above. Mann suggests that sociologists 
must examine the total structure of relations among the institutions within 
which four principal forms of social power—economic, political, military and 
cultural—are embedded by their holders. Mann argues that social change 
occurs in the “interstices” of those institutions. 

Hobsbawm, Anderson, and Tilly, in their quite different historical analy- 
ses, share with Mann a view of feudal society as a complex of institutions. 
They do not look for a single class actor, nor for a single ideological, market, 
or military agent as the source of social transformation in early modern 
Europe. Instead, they are concerned with identifying the points where rela- 
tions among actors and institutions are transformed, allowing for the practice 
of a complex of new social relations of which capitalism is the foremost. 
Thus, Hobsbawm argues that mass production and industrial capitalism be- 
came possible only after the seventeenth century crisis concentrated capital 
and political power in the hands of a few, nationally based capitalists. 
Anderson sees absolutism’s role in the transition to capitalism as one of 
reorganizing the institutions and legal ideologies of feudalism, thereby open- 
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ing new sites for agency by bourgeois classes in formation. Tilly analyzes the 
state as more than the site of a self-interested elite. His state serves to define 
classes in national terms, while also reordering the sites of surplus extraction 
and of contention, and in the process defining the terrain upon which classes 
and market relations are formed. 

Giddens (1985) approaches the study of power by identifying the factors 
which concentrated surveillance, organizational administrators, the means of 
violence, and the means of ideology within states. Giddens approaches the 
transition from “traditional” to “absolutist” to “nation” states as the product of 
class and other actors’ efforts to contro] “power containers” and to enhance 
the efficacy of the containers (i.e. of the sites and institutions) they controlled. 
Giddens discusses how technological innovation advantaged those who con- 
trolled particular power centers. He does not attempt to explain how the 
concentration of power aided the development of capitalism. 

Elsewhere (Lachmann 1987, Lachmann & Adams 1988), I have argued 
that feudalism is defined by the division of power among three institutionally 
grounded elites: monarchs, clerics, and locally based manor lords. I view 
conflict among elites, as well as between those elites and the peasant class, as 
the source of social structural transformation. I try to demonstrate that the 
particular forms of absolutist, and post-revolutionary states, and of feudal and 
capitalist relations of productions are artifacts of the elite and class conflicts 
fought out in the previous structural contexts of states and economies. 

The relative salience of these and future approaches to the transition from 
feudalism to capitalism must still be determined. Much historical analysis 
remains to be done in order to explore the various theoretical directions 
opened by Marx and Weber, and by recent.efforts to synthesize their 
frameworks. At this moment it is fair to predict that scholars will advance 
further if they define capitalism as more than a set of rational practices which ` 
emerged gradually and were transferable across time and place. Scholars need 
to concentrate on explaining the development of new.class relations between 
property owners and wage laborers. The bourgeoisie gained the capacity for 
revolutionary action within feudalism and developed interests (which only in 
retrospect do we recognize as capitalist) in reaction to challenges from feudal 
elite and class conflict. The specific forms of capitalist class relations which 
emerged in different regions and nations of Western Europe were products of 
the opportunities opened to the bourgeois classes formed within feudal social 
formations. | 
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Abstract 


The purpose of this review is to assess the adequacy of various economic and 
sociological explanations in accounting for certain key features of change in 
large-scale corporations, including vertical integration, product related and 
unrelated diversification, and the implementation of the multidivisional form. 
We first review the various economic theories that purport to explain these 
phenomena, including the structure-conduct-performance perspective, the 
literature on managerial discretion; transaction cost analysis, contingency 
theory, and the evolutionary theory of economic change. All of these litera- 
tures have efficiency mechanisms that drive their explanations although they 
work in different ways. We then consider sociological approaches including 
the literatures in organizational theory on institutionalization and power. Our 
review of the empirical work provides little support for the economic view of 
vertical integration and unrelated product diversification, and only modest 
support for the economic view of product-related diversification and the 
multidivisional form. The sociological views aid in understanding all of these 
phenomena to some extent. We conclude by suggesting that the concept of 
organizational fields may be useful in understanding these structural changes. 


INTRODUCTION 


In the past, organizational theorists in sociology have generally shied away 
from explicit study of change in the strategies and structures of the profitmak- 
ing firm. Instead, sociologists have concentrated on change in nonprofit and 
public sector organizations, leaving the study of change in profitmaking firms 
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to intervention-oriented practitioners of organizational design. The promise of 
economic explanations, with their focus on market forces, to account for most 
of the important changes in firms has led to the view that sociological 
accounts of change in those organization are not likely to explain much 
additional variation (see for example, Meyer & Rowan 1977). The assumed 
priority of economic explanations has been particularly powerful when certain 
structural changes are at issue. These changes—vertical integration, di- 
versification, and divisionalization—are generally seen as rational responses 
to economic exigencies, and thus as immune to sociological explanation. 

This theoretical division of labor obscures several important points. First, 
there is no a priori reason why sociological theories of organizational change 
should not be applicable to profitmaking firms. Second, market processes, 
like exchange and pricing, may be amenable to at least partial explanation in 
sociological terms. Indeed, recent work has shown how the social structure of 
markets affects the patterns of interactions (Eccles 1985, Baker 1984). Final- 
ly, a concern with change points up the relevance to sociologists of a large 
body of work in industrial organization, business history, and institutional 
economics. This work forces organizational sociologists to sharpen their 
claims about what can be explained. | 

A unitary view of change in the profitmaking firm does not exist for either 
economics or sociology. Clusters of views can be broadly characterized, 
however. In doing so, we are less concerned with the disciplinary affiliations 
of authors than with the mechanisms they employ in their theories. For 
instance, population ecology theory, generally thought of as a sociological 
theory, is more compatible with economics than with sociology because of its 
focus on selection mechanisms in an environment largely defined in market 
terms (Hannan & Freeman 1977, 1984). 

Economic perspectives center around the requirement of efficient produc- 
tion imposed by production for a market. In all accounts that accept the basic 
neoclassical framework, actors who control organizations attempt to 
maximize profits (or growth) by optimizing on the relation between inputs and 
outputs. The neoclassical theory of the firm treats as unproblematic the 
strategy and structure of the firm within which this optimization problem is 
solved (Penrose 1959). Other approaches within economics examine the 
structure of the firm as an explicit part of the problem of maximization. Less 
orthodox views question the notion of maximization itself, arguing that 
organizational processes and human limitations. make the ability of organiza- 
tions to adopt optimal behaviors problematic. We examine five perspectives 
on this problem: the structure-conduct-performance perspective which has its 
roots in price theory (Stigler 1968, Caves 1982, Scherer 1980); models that 
focus on the problem of managerial discretion (Berle & Means 1932, Baumol 
1959, Marris 1964, Marris & Wood 1971, Williamson 1964, Leibenstein 
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1976); the various forms of transaction-cost analysis (Williamson 1975, 
Coase 1937, North 1981, Alchian & Demsetz 1972, Jensen & Meckling 
1976); work we will call rational adaptation theory, which views managers as 
problem solvers who attempt to align organizational structures with the 
requirements of various objectively existing, but not always easily un- 
derstood, market situations (Chandler 1962, 1977; Lawrence & Lorsch 1967; 
Thompson 1967); and the evolutionary theory of Nelson & Winter (1982). 

Sociological approaches are even more diffuse than economic approaches 
as they lack the efficiency principle as an intellectual anchor. Nonetheless, 
certain key features characterize such models. Most importantly, sociological 
models often call into question the close link between action and outcomes 
that underlies economic models, at the level both of managerial perception 
and of the operation of markets. For large firms, especially, the extent to 
which survival is affected by action, within broad limits, may be quite low. 
Market power, diversification, stores of resources, and political claims may 
combine to attenuate severely the disciplinary force of markets. Even given a 
strong connection between outcomes and actions, the ability of managers to 
evaluate the connection correctly may be low, opening up the possibility that 
managers operate in a world that is socially constructed rather than objectively 
accurate. This point, too, is corrosive ‘of notions of market forces shaping 
firms through consequences for survival, when it is realized that the market in 
fact consists‘of other firms similarly hampered in their information handling 
and analytic capabilities. From this perspective, successful change and 
adaptation may have little to do with efficiency. 

Sociological theories of change can be grouped into two categories (a) 
theories which broadly argue that behavior within firms is normatively gener- 
ated, or (b) theories that focus on the role of powerful actors in producing 
change and defining its limits. Within each of these traditions, attempts are 
made to specify whether the locus of change is internal or external to the 
organization. The Carnegie School views internally generated standard op- 
erating procedures as normative principles that guide action in a complex 
world and hence explain key features of organizational structure and change 
(March & Simon 1958, Simon 1957, Cyert & March 1963). More recent 
work stresses the role of institutional environments as the source of rule- 
governed behavior and change in organizations (Meyer & Rowan 1977, 
DiMaggio & Powell 1983, Tolbert & Zucker 1983). Alternatively, other 
authors have stressed the role of powerful actors who have the ability to alter 
their internal and external environments. Some have located the nexus of 
power in terms of claims made to solve the problems of the organization 
(Pfeffer 1981, Fligstein 1987). Others have argued that resource dependence 
determines who wins the power struggle and who guides the organization 
(Pfeffer & Salancik 1978, Burt 1983). 
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This review is organized in the following way. We first examine various 
economic views and then delineate varieties of sociological theories. Next, 
we attempt to examine what is known about the three important organization 
changes we mentioned above: the creation of vertically integrated firms, the 
spread of diversification strategies (which can be further subdivided into 
movement into production of related and unrelated products), and the disper- 
sion of the multidivisional form. The goal is to consider the adequacy of the 
theories as explanations of the things they claim to explain. Finally, we 
discuss the strengths and weaknesses of the two approaches, with the goal of 
suggesting the direction work on firms might take. 


Economic Approaches 


The history of the economic theories of organizational change we consider 
here, with the notable exceptions of Leibenstein’s X-efficiency theory and 
Nelson & Winter’s evolutionary theory, is one of introducing new elements 
into the basic structure of profit maximization in an attempt to capture the 
phenomena of interest without expanding the basic set of assumptions. The 
preoccupations of industrial organization theory, the subfield of economics 
that first treated systematically the substantive issues we examine, reflect this 
restriction. Under conditions of perfect competition, profit-maximizing is 
synonymous with achieving the highest possible degree of efficiency, given 
the information about scarcity and alternative uses of resources implicit in 
prices for inputs and outputs (Stigler 1968). 

Under less than perfect competition, in the empirical world of oligopoly 
and monopoly, an alternative route to profit-maximizing is available. Firms 
can, at least potentially, use their positions in markets to extract prices higher 
than those required by efficient production, and retain the difference as rents 
(Tullock 1980). The link between inhibited competition and market power 
produced a theory—known as the structure-conduct-performance per- 
spective——linking technologies of production with structures of markets, with 
the goal of distinguishing between actions aimed at increasing efficiency and 
actions aimed at extracting “excess” profits by inhibiting competition (Caves 
1982, Scherer 1980, Stigler 1968, Bain 1956, Baumol et al 1981). The 
features of organizational structure we examine—divisionalization, di- 
versification, and vertical integration—-came under particular scrutiny be- 
cause traditional economic theory had difficulty explaining them in terms of 
efficiency. 

Lacking an explicit theory of internal organization, industrial organization 
theorists focused on economies in production and marketing to explain be- 
tween-industry variation in market structure and change over time. These 
economies are of two types: (a) scale economies, deriving from the cost 
characteristics of production technologies and, in some theories, the benefits 
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of large-scale advertising, and (b) scope economies, deriving from synergies 
in production or marketing for related products (Caves 1982, Stigler 1958). 

Vertical integration is not strictly explicable in either of these terms. In 
fact, scale economies would tend to militate against vertical integration, 1f, 
for example, the efficient scale for production of an input is large relative to 
the size of the market or other components of the production process. Integra- 
tion may be explicable in.terms of scope economies, however, if information 
generated in one stage of production is applicable to others (Arrow 1974). It 
may still be more efficient simply to communicate this information to suppli- 
ers, however. The most common explanation for vertical integration, in- 
terdependencies in production, is actually a variant of the scope argument, but 
it is limited in application to continuous process technologies. It is less useful 
for the most interesting cases, where integration is not technically required, at 
least not in an obvious way. Industrial organization analysts are therefore 
wary of vertical integration in other contexts, suspecting that vertical integra- 
tion represents an attempt to control inputs in order to disrupt the supplies of 
competitors (Stigler 1968). 

An industrial organization perspective on the multidivisional form would 
concentrate on economies, in research and development, production, or 
marketing, associated with the production of related products. If two products 
share a common input or are based on similar technologies, for example, it 
may be possible to lower unit costs by producing both rather than one. The 
focus here is on the reasons why firms might want to produce for related 
markets, not on the administrative structure most appropriate for such produc- 
tion. Industrial organization theory does not constitute an explanation for the 
adoption of the multidivisional form itself, except to say that if it is the most 
efficient way of administering the multiproduct firm, it will be adopted. 

Unrelated product diversification, or the conglomerate, poses the most 
difficult problem for industrial organization theory. By definition, no syner- 
gies in production or marketing exist. An alternative explanation focuses on 
the rate of return in a firm’s main business. If more attractive opportunities 
exist in other markets, firms could diversify in order to improve their financial 
performance (Gort 1962). This explanation, although intuitively appealing, 
requires the assumption that capital markets are functioning imperfectly. 
Under assumptions of perfectly functioning capital markets, such investment 
should take place by individual owners through financial markets rather than 
by firms. This is particularly true for diversification into unrelated businesses, 
where firms could not be expected to bring specialized knowledge or assets 
that would improve the performance of the acquired firms. 

Faced with problems such as these, and other anomalies, some economists 
of industrial organization began to doubt that the profit maximization view of 
the behavior of managers and entrepreneurs was adequate to account for the 


78 FLIGSTEIN & DAUBER 


conduct of firms. Berle & Means (1932) were the first to suggest that the rise 
of professional managers, the dilution of ownership holdings, and the poten- 
tial organizational slack of large corporations meant that firms were no longer 
forced to produce maximal profits. This view did not really develop in the 
mainstream of economics until the postwar era because no one could figure 
out how to incorporate the Berle & Means’s insight into models of industrial 
organization. The work of managerialists produced a view of the modern 
corporation that seemed more consistent with what existed and, at the same 
time, fit with the dominant paradigm of price theory (Penrose 1959, Baumol 
1959, Marris 1964). 7 

Managerialist theory essentially involves the substitution of different be- 
havioral assumptions for the profit-maximizing assumption of standard 
theorizing. The most common move is to substitute growth or sales as the 
object of maximization, perhaps subject to a profit constraint imposed by 
financial markets or the threat of takeover (Marris 1964, Mueller 1986). 
Focusing on the incentives of managers, managerialists argue that managers 
benefit from growth because of its effects on salaries (through increasing the 
number of levels in the firm hierarchy, for example) or on status (the best 
discussion of these issues remains Marris 1964: ch. 2,3). Since all of the 
changes under consideration are related to growth, managerialist arguments 
are applicable to all of them. They are potentially most useful for diversifica- 
tion, however. It is difficult to construct a profit-maximizing rationale for 
diversification, but it fits quite comfortably into a growth-maximization 
argument (see Mueller 1977 for a review of managerialist literature on 
conglomerates). Managerialist arguments also imply that manager-controlled 
firms should display changes for which growth is the major motivating factor 
at a higher rate than do owner-controlled firms. 

The .managerialist approach differs most decidedly from the industrial 
organization approach in how managers’ motivations and goals are viewed. 
The industrial organization response has been that both managers and owners 
are subject to market constraints. Therefore, neither can stray very far from 
profit maximization without being disciplined by the market (Caves 1982). As 
we noted above, however, managers are more likely than owners to undertake 
courses of action that favor growth in sales or assets. For that reason, as 
managers came to control more and more corporations, the strategies and 
structures of those corporations shifted to reflect the new goal of growth 
maximization. This shift at the level of individual firms also altered the 
competitive environment. While the market still decides what is profitable 
and what produces growth, the central motivation of those who control has 
shifted. : 

The most radical break, within the managerialist tradition, with the in- 
dustrial organization approach is Leibenstein’s work on X-efficiency (1976; 
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see Frantz 1988 for a useful survey of theoretical and empirical work on these 
issues). Leibenstein argues that, except in highly competitive markets, firms 
are not forced to be maximally efficient, and hence they produce at higher 
prices and costs than optimal. He grounds this argument in a psychological 
model of action in which maximum effort and search behavior are resisted 
except under extreme conditions: Leibenstein’s work broadly supports 
sociological arguments that invoke the notion of “organizational slack,” but in 
doing so it loses the ability, possessed by most other economic models, to 
predict determinately the content of action from a description of the situation 
facing firms. 

A different sort of break with the industrial organization perspective came 
with the reintroduction of the costs of exchange and of contracting con- 
siderations into the theory of the firm (Coase 1937, Williamson 1975, Arrow 
1974, Alchian & Demsetz 1972, Fama 1980, Jensen & Meckling 1976, North 
1981). The source of these costs is quite varied—information asymmetries, 
bounded rationality, potential opportunism, etc-—but the essential point is that 
the exchanges basic to economic processes are not costless. Firms act to 
minimize these costs (to the extent compatible with the other elements of the 
optimization problem). This gives the observation explanatory force while 
maintaining its essential continuity with neoclassical models of firm behavior. 
It is worth noting that Douglass North’s version (1981) takes seriously the role 
of the state and the current stock of ideas, particularly those concerning 
property rights and monitoring techniques, in shaping the costs of exchange. 
We do not review here the wealth of literature growing out of these lines of 
argument (a useful review can be found in Ricketts 1987). Instead, after a 
brief explication of the fundamental insights, we focus on specific explana- 
tions offered for the varieties of organizational change we are considering. 

The line of work with the greatest applicability to these issues is the 
transactions-cost analysis associated with Oliver Williamson (1975, 1985). 
Williamson begins with the behavioral assumption that actors are boundedly 
rational and opportunistic. The important consequence for transactions is that 
not all eventualities can be predicted without cost and accounted for in the act 
of contracting, and actors cannot be trusted to refrain from. appropriating 
resources for themselves to which they are not entitled. 

The problem of opportunism becomes particularly serious and potentially 
costly when the degree to which a resource can be transferred to another use is 
highly limited. Asset specificity results when resources take forms specific to 
the relationship itself, as with specialized machinery, formal and on-the-job 
training, and nonstandard parts. The potentially large number of suppliers 
available before the inception of the transaction is transformed to a small 
number (perhaps one) through the process of exchange. This produces, at the 
limit, the ability for one party to extract monopoly rents (for a discussion of 
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rent seeking, ,see Tullock 1980). The choice between contracting to the 
market and producing a hierarchy (which is an internalized form of contract- 
ing) is aimed at dividing property rights in the resources under consideration 
in a way that will minimize potential costs associated with appropriations of 
surplus. The solution to this problem, the incorporation of the transaction 
within an organization, allows opportunism to be suppressed through the use 
of authority relations. 

Apart from problems of opportunism, bounded rationality produces diffi- 
culties in contracting through the market when the nature of the task is 
difficult to specify or when short-term monitoring of performance is difficult. 
Organizing transactions within firms eliminates the need for highly specific 
contracts or frequent renegotiation of contracts as new circumstances arise, 
and it allows a range of monitoring over a long period of time, which is 
difficult in market transactions. Incorporating transactions within firms can 
also be an effective way to deal with information asymmetries, when it would 
be difficult to provide information on processes, for example, through the 
market or when a supplier might be reluctant to invest in a situation where he 
or she was uncertain about returns (Silver 1984). 

Vertical integration can be explained in a number of ways using this 
perspective. Most obviously, vertical integration may defeat possibilities for 
opportunism due to small numbers bargaining by bringing administrative 
sanctions to bear on exchanges between different stages of the production 
process. In a related way, vertical integration forward into marketing may 
prevent degrading the value of a brand name by opportunistic or incompetent 
marketing. More subtly, vertical integration may be the cheapest way to 
organize production of a new kind of product, because there is an information 
asymmetry between the entrepreneur and potential suppliers: If this is the 
case, then new industries should exhibit the highest levels of vertical integra- 
tion, with the level of vertical integration declining over time as the worth of 
the innovation is proven and as the technical knowledge involved becomes 
more widely known (Silver 1984). 

Williamson argues that the multidivisional form represents an effort to 
economize on bounded rationality, because managers are unable to handle 
effectively the information required to operate simultaneously in several 
different markets (1975). This is not essentially an argument about transac- 
tions costs, however, since it is not the costs associated with exchanges which 
are at issue but the.costs.associated with management of diverse activities. 
Williamson also mentions opportunism as a motivation for the multidivisional 
form, but this argument is less convincing (1975). 

Diversification, from this perspective, results from the failure of capital 
markets to control investment in the economy effectively. The basic problem 
is that stock exchanges and investment bankers are not in the position to 
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evaluate which products are likely to produce high rates of return. If a firm is 
reconceptualized as a capital market that allocates capital to its various 
businesses, then diversified firms will have the advantage of being able to 
monitor more closely the performance of the individual units in the firm. In 
this way, firms are more efficient at allocating capital, producing profits, and 
selecting winning and losing industries than are the financial markets (Wil- 
liamson 1975). 

In all of the economic theories we have considered, judgment about the 
appropriate response to changed conditions is not problematic. This is true 
even if, as in transactions-cost theory, the change itself results in con- 
sequences that could not be anticipated because of bounded rationality. There 
is thus a curious mismatch between the limited ability of actors to obtain and 
evaluate information about the exchanges in which they are involved, and 
their prodigous ability to devise the optimal response to the problems of 
bounded rationality inherent in those exchanges. 

The final economic theories we consider retain the notion of an optimal 
response to objective conditions but introduce doubt about the ability of 
managers to evaluate their firm’s situation easily and to generate this optimal 
action. Those managers who correctly assess the demands of the environment 
will produce gains for firms. This perspective is most closely associated with 
the seminal work of Alfred Chandler (1962, 1977) but is also implicit in the 
line of analysis commonly termed “contingency theory” (Lawrence & Lorsch 
1967, Thompson 1967). 

The essence of this approach is to credit markets and internal hierarchies 
with presenting managers with critical problems. The ability of managers to 
perceive and solve those problems is the critical variable that determines 
success or failure. Managers who correctly analyze their problems are able to 
produce adaptations that guarantee organizational survival. This is an eco- 
nomic model in that rational actors can take actions that have consequences 
which objectively promote the success of their organizations. The sociologi- 
cal side of the theory concerns the difficulty of actors perceiving the problems 
of their organizations and constructing solutions. An implicit theory of orga- 
nizational change stresses the activity of key actors and their power to 
transform their organizations. 

The inspiration for this perspective comes from the work of Chandler 
(1962, 1977), who argues that managers have been the most important force 
promoting the growth of large-scale industry in the United States. By observ- 
ing the possibilities of markets, and through trial and error approaches to their 
internal organizational problems, managers have provided the visible hand 
that has guided American business in the past 100 years. While for Chandler 
the central causal force has been the market (1962), which has provided 
opportunities for managers to produce and sell products, he focuses on the 
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process, often halting and ill-understood, by which managers reacted to 
successes and failures in the market when structuring their organizations. 

It is for this reason that Chandler resists the view that transaction-costs 
economics constitutes an adequate explanation for change. In an exchange 
with Williamson, Chandler argues that transaction costs cannot account for 
the actions of managers because they are too poorly defined as an analytic 
concept (1981). Instead, Chandler argues that managers are not intent on 
minimizing transaction costs or easing problems of opportunism, but instead 
they are decision-makers confronted by puzzles that indicate problems. His 
view is grounded in the concrete context in which managers find themselves, 
and although Chandler himself might disagree, this view causes analysts to 
focus attention on the perceptual processes of key managers and their ability 
to solve problems. In fact, the usefulness of Chandler’s work from a sociolog- 
ical view is less in its arguments about the economic utility of organizational 
innovations than in its detailing of the difficulties in assuming that utility will 
lead unproblematically to perception and action. 

Nelson & Winter (1982) have developed a more formal approach to the 
issues that Chandler addresses in their evolutionary theory of economic 
change. Organizations are conceived of as sets of routines designed to deliver 
a product reliably. Managers monitor the success of their current routines, 
largely by reference to other firms, and invest in a search for improved 
routines based on their evaluation of the firm’s performance. This search can 
be directed at developing new techniques internally or at mimicking the 
routines of more successful firms. The differential performance of firms, 
presumably attributable to different mixes of routines, thus produces both 
diffusion and innovation. While Nelson & Winter’s model is sociological in 
its assumptions about the difficulties in producing organizational change, it 
retains its connection with economics in its stress on the role of the market in 
selectively rewarding and punishing firms based on performance. 


Sociological Approaches 


Sociologists have been less concerned than have economists with the specific 
structural changes that are the subject of this review. In this section, therefore, 
we sketch sociological approaches to change in profit-making organizations, 
which we then use in the next section to provide alternatives to economic 
interpretations of the data on the determinants of change. Sociological views 
of firms emphasize the looseness of that connection between actions and 
outcomes which economic arguments depend upon. This looseness is of two 
types. First, the connection between firm actions and outcomes is loose in an 
analytic sense. The “discipline of the market,” viewed in neoclassical eco- 
nomic models as certain and instantaneous, is in fact attentuated in time and 
severity. For large firms particularly, a safety net of market power, control 
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over resources, diversification, and political leverage prevents the translation 
of less-than-optimal performance into the firm’s extinction. To use a familiar 
term, sufficient slack exists in organizations that a range of actions is con- 
sistent with survival (Thompson 1967, Leibenstein 1976). 

Second, the connection between actions and outcomes is also loose in a 
subjective sense; managers cannot reliably identify the relations that produce 
the outcomes they observe and hence they cannot generate appropriate re- 
sponses. The complexity of events in large firms makes the assignment of 
causes to events extremely problematic, even impossible. If IBM’s profits go 
down 10% in one quarter, what is to be blamed? The list of possible 
candidates is vast-manufacturing glitches, competitor’s actions, currency 
fluctuations, an overloaded administrative structure, etc. This is, in essence, 
the problem we pointed out with transaction-cost economics. To expect 
managers, faced with the actual manifestation of bounded rationality, to 
generate the optimal response violates the assumption of inadequacy which 
set up the problem in the first place. 

Having dumped managers into a Jamesian “blooming, buzzing confusion,” 
the problem is to explain how managers act at all. After all, the economic 
image of managerial action does have some appeal—managers evaluate 
Situations, consider alternatives, make decisions. Reconciling these two 
points of view requires an understanding of how the worlds in which manag- 
ers manage are created, an understanding which will produce some notions 
about the conditions under which those worlds might change. In this connec- 
tion, in fact, it is interesting to note that some economists who are explicitly 
concerned with the process of change have turned to various sociological 
accounts for their models of action (e.g. Nelson & Winter 1982). 

There are two broad sociological approaches to answering the question 
about how managers act; we term these normative and power-based. The 
normative approach stresses that action is guided by rules that standardize 
behavior and reduce uncertainty. The power perspective emphasizes the 
relative power of actors to construct courses of action and impose them on 
others. Within these two approaches, theories can be distinguished by 
whether they specify the locus of change as inside or outside the organization. 

In the normative approach, action is made possible by organizational 
routines—routinized operating procedures, standard ways of interpreting 
events in the “environment” (and indeed the definition of the environment 
itself), and agreed-upon hierarchies of values. The complexity of the world is 
thus artificially reduced by structures of beliefs and practices. The notion that 
it is possible to devise optimal responses to bounded rationality is rejected. 
Instead, widely held beliefs about the world have the power to organize 
understanding and perception and so make the kind of action which even the 
most radical work in the neoclassical tradition posits unlikely to occur. 
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The most influential of the normative approaches, taken by the Carnegie 
School, focuses on internal processes leading to a distinctive view of organi- 
zational change. If there are no obvious signs of disfunction, organizations 
use routinized procedures to generate unchanging action. In the presence of 
such signs, however, organizations also use routinized procedures, but they 
use them to produce solutions which involve changes in patterns of action. 
The critical point for our purpose is that such changes are conservative: minor 
changes are considered before major changes, and the search stops when a 
change considered likely to work is found (March & Simon 1958, Cyert & 
March 1963). Sharp breaks are seldom observed, and changes which in 
retrospect appear radical are made up of a large number of smaller, in- 
cremental shifts. This description of the search process is in fact consistent 
with Chandler’s case studies of the process leading to the multidivisional form 
(Chandler 1962). 

Theories that focus on external sources for routinized procedures do not 
posit an incremental change process at the level of individual organizations. 
In these models, features of organization structure are not responses to tasks 
imposed by environments. Instead, organizational practices themselves have 
their origins in environments and are adopted by organizations. The mech- 
anisms that can produce such changes vary. DiMaggio & -Powell (1983), for 
example, argue that professionalization can produce common practices 
among a set of organizations because of the common training of organization 
members. Fligstein (1985, 1989a) finds evidence that competitors adopt the 
innovations of leading firms that are perceived as successful. Meyer & Rowan 
(1977) posit a more macrolevel process of progressive rationalization. In this 
view, participation in a common culture—which gives value to organizational 
features seen as “rational”—produces adoption of those features. 

From the point of view represented by normative theories, the problem of 
explaining why firms adopt the innovations that we are considering is a 
problem of explaining why managers in certain firms come to believe that a 
particular way of organizing will improve the performance of their firms. The 
Carnegie School’s answer is, in many ways, close to that of economic theory. 
These innovations arose because persistent problems manifestly could not be 
solved using existing methods, and they were the least disruptive change that 
could ameliorate the problems which provoked the search. To anticipate some 
of the evidence we consider in the following section—the Carnegie School 
can also explain the persistence of some of these practices (such as vertical 
integration in the oil industry) after the period in which they were effective: 
Unless routinized practices result in substantial disruptions, they: won’t be 
changed even if the situation that originally produced them no longer holds. 
Stinchcombe (1965) makes a similar argument. 

The various diffusion theories represent a more fundamental break with the 
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causal factors underlying economic models. The combination of perceptions 
and patterns of interaction is what produces similarity in organizations, and 
changes across organizations—not common problems in production for the 
market. In these views, in fact, the explanations that economists offer for 
innovations may serve simply as rationalizing accounts, serving as jus- 
tifications for action, but not actually motivating it. 

In the most extreme of these views, shared beliefs about the world fill a 
psychological need for certainty, and world views acquire a “taken for 
granted” quality which renders questioning extremely difficult (Zucker 1977). 
Such a view risks leaving no room for innovative behavior, which necessarily 
involves at least some departure from accepted behavior. In slightly different 
terms, such extreme views leave little room for agency in accounts of 
organizational change (DiMaggio 1988). Attention to the process of diffu- 
sion, particularly to the intra-organizational dynamics involved, including 
examinations of failed as well as successful innovations, should produce a 
more balanced picture. 

In the second broad approach, the power-based perspective, attention is 
paid to the implications of power relations for organizational action. If 
intra-organizational power relations are considered, stability is easier to ex- 
plain than change. Changes in organizational strategy and structure are likely 
to have negative consequences for actors who have power under the current 
Structure, and the changes are therefore likely to be resisted by them (Hannan 
& Freeman 1984, Crozier 1964). For instance, Palmer et al (1987) argue that 
owners are less likely than managers to diversify and divisionalize the firms 
they control because they are afraid of losing that control. Alternatively, 
Fligstein (1985, 1989a) shows both that innovations are sponsored by sub- 
units which have the most to gain from their implementation and that success 
can provide the resources necessary to institutionalize changes. 

Extending the argument to the organization’s immediate environment sug- 
gests that the power of organization members is dependent, to some extent at 
least, on their ability to control critical contingencies facing the organization 
(Pfeffer & Salancik 1978, Pfeffer 1981, Perrow 1986). There are a number of 
important sources for this ability: expertise that will aid the organization’s 
success, connections to important resources outside the organization such as 
funding, suppliers, or customers, and structural position within the organiza- 
tion. If critical contingencies change, then the balance of power within the 
organization may change as well. The structural predictions that result from 
this point of view may well differ from transactions-cost analysis more in the 
mechanism than in the outcomes. From a power perspective, it is not calcula- 
tion that aligns the interests and actions of organizations, but the power that is 
gained from making “correct” decisions. 

Perrow (1981) approaches the problem of power in profitmaking organiza- 
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tions from a different perspective. His major criticism of Williamson and 
Chandler is that the efficiency approach ignores the ultimate ends of corporate 
actions. From this point of view, firms are engaged in competitive struggles to 
create oligopolies and monopolies and to disrupt their competition. Hence, 
their actions are oriented to increasing their power vis 4 vis one another and 
not toward increasing internal efficiency. Similarly, the creation of internal 
labor markets and the multidivisional form have nothing to do with the 
minimization of transaction costs or solving of organizational problems due to 
ineffective coordination. Instead, such actions are oriented toward extending 
the control of managers over their lower level managers and workers. 

Unfortunately, it is often difficult to tell the difference between a power 
struggle account of what happens in the firm and an efficiency account. If the 
firm’s managers are trying to increase their profits by undermining their 
competition or controlling their workers, then this can be seen not as an 
attempt to gain “power,” but as an effort to reduce costs in order to survive in 
a market (Williamson & Ouchi 1981). It is useful to remember, however, that 
the range of possible organizational structures is restricted by the distribution 
of power within the system; in economic terms, the initial distribution of 
resources and property rights constrains the future course of exchange. 

Adopting an. interorganizational perspective, DiMaggio & Powell (1983) 
argue that organizations may use power over other organizations, generated 
through the mechanisms that Williamson identifies, for instance, in order to 
force changes in structure. They offer no argument about why organizations 
would use their power in this way, however, and a more plausible line of 
argument would focus on organizational defenses against such dependencies, 
as in the economic arguments about vertical integration. 

More promising approaches retain the emphasis on coercion, but provide 
more realistic mechanisms. For example, the notion of legitimacy can be 
applied to organizational structure. In this view, actors are unwilling to enter 
into exchanges with organizations that do not manifest the “normal” features 
of organizations in a particular industry (DiMaggio 1985). 

The state is also a powerful force in shaping organizations, in several ways. 
In the extreme, the state may directly mandate certain structures, as when it 
deals with social service providers, for example. More subtly, the legal forms 
the state makes available for organizing favor some structures and not others. 
In a more negative vein, legislation banning certain forms of organization— 
for example cartels, monopolies, etc—-exerts a direct influence on structure 
(Fligstein, 1989b). 

The sociological approaches stress the difficulty of shifting organizational 
forms, the role of power in the maintainance and the transformation of forms, 
and the need to construct market processes and define the environment. 
Organizational changes should therefore occur infrequently, spread slowly, 
and persist even if they do not make profit. Their spread will depend on the 
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institutional environment, the internal power struggle, and resource depen- 
dence of given organizations. 


Economic and Sociological Explanations of Changes in 
Corporate Forms 


Our purpose here is to examine the evidence regarding three structural 
changes——shifts toward vertical integration, divisionalization, and diversifica- 
tion. While we have argued that sociological perspectives are applicable to the 
study of these phenomena, the large majority of empirical work on these 
issues has focused on testing hypotheses generated from economic theories. 
This produces several problems in attempting to compare directly the utility of 
the two perspectives. First, the theories are posed at varying levels of general- 
ity and often refer to different processes. Therefore, it is nearly impossible to 
generate a critical test that shows one theory to be right and the other wrong. 
Second, economic theories generally assume that the observed distribution of 
firm strategies and structures represents an equilibrium. The important im- 
plication for our purposes is that the observed structure represents an attempt 
to optimize under present conditions. In investigating the consequences of 
vertical integration; for example, the most common procedure in the eco- 
nomic literature is to examine a sample of firms over a short time and to 
compare the profitability of those that have integrated with those who have 
not. This type of design makes it difficult to evaluate hypotheses about the 
diffusion of processes or their origins, which require data on the performance 
of firms before and after integration and the actions of other firms in the 
industry. 

Given the history of the investigation of these issues, therefore, sociologi- 
cal explanations will unavoidably be offered to account for results that appear 
anomalous according to economic theories; but it will generally be impossible 
to test directly the utility of sociological explanations. We. hope that this 
review will stimulate efforts to make that possible. 

The literature considered first revolves around the adequacy of efficiency 
explanations of vertical integration. The most comprehensive test of this 
thesis comes from Ravenscraft (1983), using the Lines of Business dataset. 
This dataset was developed by the Federal Trade Commission in order to 
examine the relation between structure, conduct, and performance at the 
product level using both firm and industry data, rather than one or the other. It 
contains information on 3674 lines of business operated by 456 corporations 
from 1954-1981. Ravenscraft finds that vertical integration does not signifi- 
cantly affect profits within lines of business. 

Rumelt studied a sample of Fortune-500 firms from 1949-1969 (1974). He 
considers the effects of different corporate strategies, including product re- 
lated, conglomerate, and vertically integrated strategies, on a variety of 
measures of growth and profitability. He concludes that vertical integration is 
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one of the lowest payoff strategies (1974:94). Gort (1962) studied the pro- 
cesses of diversification and integration in the 111 largest firms from 1929— 
1954 (1962). He was unable to show any relation between the degree of 
vertical integration and the profitability of the firm (1962: ch. 5). Fligstein 
(1986) has shown that in the postwar era, members of the 100 largest firms 
with elements of vertical integration have not grown relative to other .firms 
and have actually experienced a statistically significant lower growth rate in 
sales, assets, and profits relative to those other firms. 

Luffman & Reed (1984) studied a sample of the 496 largest British 
corporations from 1970-1980. They used a design similar to Rumelt’s to 
examine the consequences of various strategies on profit outcomes. Their 
results generally supported Rumelt’s. Firms that were diversifying grew faster 
than those that were not. Vertically integrated firms grew the least and were 
the least profitable (1984: ch. 5). One piece of evidence ran counter to 
Rumelt: vertically integrated firms producing one product were highly profit- 
able. This category included only a small number of firms, however. Product- 
dominant firms that were vertically integrated, such as most steel companies, 
performed less well than diversified firms. 

Levin studied the role of vertical integration in the performance of the oil 
industry between 1948 and 1972 (1981). He states: “The primary conclusion, 
that there are neither significant economies or diseconomies to vertical inte- 
gration, is remarkably robust” (1981:226). Similar results were obtained by 
Armour & Teece (1978) in their study of oil company performance between 
1955-1973. i 

Adelman (1965) demonstrated that there were no discernible trends in the 
amount of vertical integration in the American economy from 1849-1939, 
and Laffer (1969) came to similar conclusions for the period 1929-1965. 
Tucker & Wilder (1977) extended these analyses to the years 1954—1972 and 
showed a slight increase in vertical integration. Their most interesting results, 
however, concerned the determinants of the amount of vertical integration in 
firms at the end of the period. They showed that the growth of the industry 
was negatively related to its level of vertical integration, and the industry’s 
concentration was a positive cause of integration (1977:85). 

Taken together, these studies suggest that in the modern era, across two 
highly industrialized societies, vertical integration was not a positive indicator 
of good economic performance for most large corporations. This result 
challenges the strong version of neoclassical theory, which predicts that 
organizational innovations occur because they aid efficiency. The result also 
poses problems for transaction-costs economics, since it is difficult to explain 
why the firms that integrated did so. One possible rejoinder, of course, is that 
the integration represents an error on the part of managers, but then the 
predictive power of the theory. is called into question. 

It is possible to argue that integration was a useful strategy at an earlier 
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stage, and that the observed integration represents the persistence of the 
practice even after its efficiency effects cease (Levin 1981). Indeed, one of 
the central themes of Chandler’s Visible Hand is that managers in the late 
nineteenth and early twentieth centuries took over the functions of markets 
because markets could not provide inputs reliably or absorb outputs of 
efficient plants rapidly. Backward and forward integration of production 
ensured corporations of supplies and customers for plants that required econo- 
mies of scale to operate efficiently. Most of Chandler’s book actually con- 
cerns the role of forward integration into marketing and its role in helping the 
firm survive. The difficulty for economists with using a persistence argument 
is that it calls into question the assumption of equilibrium that makes possible 
the evaluation of an observed state in terms of its presumable optimality. If 
one takes into account the fact that it is not merely the persistence of 
integration in firms but also the adoption of integration by firms which must 
be explained, the problem grows worse. 

A sociological view would stress that managers created vertically inte- 
grated firms for reasons other than efficiency gains (Perrow 1981, Fligstein 
1989b). Instead, vertical integration was one aspect of the attempt to create 
monopolies in the core of American industry at the turn of the century. In fact, 
the thrust of the older tradition in industrial economics does also cast doubt on 
the efficiency claims (Burns 1936, Scherer 1980). This tradition saw vertical 
integration as an attempt to deny one’s competitors access to necessary inputs. 
Vertical integration could also be used to create the ability to flood the 
market, making competitors wary of crossing leading firms. As a result, 
smaller firms had to follow the lead of larger firms and integrate to defend 
themselves. . 

Fligstein (1989b:ch. 3) has elaborated this argument by showing that 
industries with extensive vertical integration were also likely to be quite 
concentrated. Large firms tried, by building up capacity, to attain sufficient 
control over the market, to enable them to set prices effectively for the 
industry. Part of this strategy involved the control of key inputs, such as raw 
materials or transportation systems. In the oil industry, for example, various 
independent producers vertically integrated to prevent Standard Oil from 
monopolizing the production and transportation of oil (Williamson & Daum 
1959, DeChauzeau & Kahn 1959). Similar conditions existed in the steel 
industry following the emergence of US Steel (Hogan 1971). The vertical 
integration that characterized the emergence of oligopoly conditions in both 
industries during the merger movement of the 1920s was defensive in nature. 
This alternative to the efficiency accounts emphasizes the power of large 
firms and the defensive nature of vertical integration. Fligstein (1986) shows 
that the only decade in the past 60 years when vertical integration positively 
affected growth for the largest firms was from 1919-1929, when the creation 
of oligopolies in key industries was accomplished. 
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The next innovation we consider is the spread of the multidivisional form 
(MDF). Those who were among the first in their industry to adopt the MDF 
were able to extract gains from that structure (Fligstein 1985, Rumelt 1974, 
Armour & Teece 1978, Chandler 1962). This supports Chandler’s or Wil- 
liamson’s assertion that the MDF produced a more efficient structure for 
operating the large corporation. There is also evidence that the major cause of 
the MDF is diseconomies that result from firms producing multiple products, 
as Chandler suggested (Donaldson 1986, Fligstein 1985). However, some of 
the studies that have examined whether or not the MDF has an effect on 
performance failed to control for product mix. Since gains to structure could 
be the result of strategy, attributing them to structure is misleading (see 
Armour & Teece 1978). 

The efficiency advantages of the MDF did not lead to its immediate 
adoption, however; this suggests that factors affecting the rate of adoption 
must be taken into account. The spread of the MDF was actually quite slow, 
(Fligstein 1985). It took from 1922 until the mid 1950s before half of the 100 
largest firms were organized as MDF. Rumelt has shown that a great number 
of diversified firms did not adopt the MDF, even when divisionalization 
became the accepted standard for large firms (1974). Palmer et al (1987) 
demonstrate that owner-controlled firms are less likely to be divisionalized 
than manager-controlled firms, net of whether or not the firm was diversified. 

In the postwar era, the business community came to believe that the road to 
success was diversification, divisionalization (or decentralization), and merg- 
ers. This conventional wisdom has dominated the discourse of managers since 
the late 1940s (Fligstein 1989b: ch.7). One of the best predictors of the 
implementation of the MDF is whether or not firms in the industry adopt it, 
particularly in the postwar era. This result could reflect such a process of 
institutionalization (Fligstein 1985). 

Explaining the emergence and popularity of conglomerates poses the great- 
est difficulties for economic theory. It is important to distinguish between 
related-product diversification (which can theoretically produce synergies in 
research, production, or marketing, as we noted above) and unrelated- 
product, or conglomerate, diversification. Generally, most observers have 
reported that diversified firms outperform nondiversified firms (Rumelt 1974, 
Biggadike 1976, Berry 1975, Luffman & Reed 1984, Chandler 1962; see 
Scherer 1980 for a review), although Gort (1962) could show no increase in 
profits or assets among diversifying firms. Almost all of the studies report that 
product-related diversification causes growth, while the’ results concerning 
product-unrelated diversification are mixed. 

The primary explanation from the older industrial economics for con- 
glomerate formation is that firms exit industries where growth was slow and 
enter industries where growth is higher. Gort (1962) presents clear evidence 
that this is so. Rumelt presents similar evidence (1974). The original acquisi- 


STRUCTURAL CHANGE IN CORPORATIONS 9] 


tive conglomerates (Textron, Litton, ITT, LTV) tended to leave declining 
industries and to enter expanding ones (US Federal Trade Commission, 
1972). The difficulty with this explanation on theoretical grounds is that it is 
unclear why investors would provide capital for firms to engage in this kind of 
portfolio management when they could invest their funds directly or indirectly 
in firms through the financial markets. Recognizing this problem, the transac- 
tion-cost perspective points to a market failure in the capital markets, making 
it more efficient for firms to perform the investment function. Essentially, 
firms can monitor the performance of their companies more effectively than 
can the financial markets can. 

Unfortunately, the transaction-cost perspective is not supported empirical- 
ly. Conglomerates almost always earned less than their product-related and 
-dominant counterparts (Lynch 1971, Biggadike 1976, Berry 1975; see Muel- 
ler 1977 for a review). Further, conglomerates came into existence at pre- 
cisely the historical moment, the postwar era, when capital markets expanded 
greatly and appeared to be operating at their most efficient. 

There is more evidence for the managerialist thesis regarding con- 
glomerates—that managers expanded their firms through mergers because 
they valued growth. Scherer (1980) provides a review which suggests that 
manager-controlled firms were more likely to increase sales and assets and 
less likely to produce profits than were owner-controlled firms. Con- 
glomerates, under the control of a small number of key managers, all ex- 
perienced much higher growth in sales and assets than they did in profits (US 
Federal Trade Commission 1972, Mueller 1977, Rumelt 1974). Fligstein 
(1986) provides evidence that conglomerate strategies produced spectacular 
changes in the sales and assets of large corporations but did little to promote 
profits. In a review of the evidence of the efficency aspects of mergers, 
Mueller concludes: “The empirical literature upon which this survey focuses, 
draws a surprisingly consistent picture. Whatever the stated or unstated goals 
of managers are, the mergers they have consummated have on average not 
generated extra profits for the acquiring firms, have not resulted in increased 
economic efficiency” (1986:217). 

The managerialist thesis provides an explanation for the findings regarding 
the low performance of conglomerates on measures of profits, but it does not 
explain why unrelated product diversification was so popular at the moment 
that it was, and why some firms engaged in it and others did not. paçiotogical 
insights can provide some partial answers to these questions. 

The federal government inadvertently encouraged product-related and un- 
related mergers by outlawing vertical and horizontal mergers after World War 
II (Stigler 1966, Adelman 1961, Fligstein 1989b: ch. 6). These actions and 
the general threat of antitrust litigation left the managers of the largest firms 
with few options to achieve quick growth besides diversification through 
mergers. 
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Fligstein has shown (1989b) that managers who originated in sales and 
marketing and finance positions were more likely to shift their organization to 
product-related and -unrelated strategies. The firm as portfolio, from this 
perspective, is a cultural construct of the 1950s and 1960s and of managers 
who emphasized the short-run financial performance of each product line. 
The conglomerates and their followers produced spectacular growth and 
entered the lists of the largest corporations in the country. This explosive 
growth appeared to other managers in the organizational fields of the largest 
firms as clear evidence of the superiority of those tactics. They joined the 
merger bandwagon. 

Perhaps the most striking conclusion that can be drawn from this review is 
that economic accounts of organizational change, while internally consistent 
and on the surface plausible, often fail when confronted with data. Sociolog- 
ical views, while they seem to be more consistent with the data, are not well 
enough developed at this point to yield more than plausible post hoc accounts. 


Conclusion 


In this conclusion we draw on our review of the economic and sociological 
literature on change in corporate organizations to suggest avenues for future 
research, both theoretical and empirical. We concentrate on extending the 
insights of sociological perspectives and in particular on the development of 
theory with predictive value. While our review has focused substantively on 
changes in organizational structure, our comments are necessarily directed to 
a broader conception of organizational change. 

An issue that points up sharply the differences between economic and 
sociological logic is the grouping of firms for purposes of understanding 
commonalities in structural changes. For economists, the notion of a “market” 
is the most useful grouping strategy. A market can be thought of as a set of 
products that can be substituted for each other (in more technical terms, 
markets are composed of products that have high cross-elasticities of de- 
mand). The firms that produce those products are in the same market. 
Changes in the situation facing those firms, such as technological innovation, 
changes in prices for inputs, etc, should affect all of the firms in a market in 
predictable ways. It follows, incidentally, that firms should not be affected by 
changes that do not affect their markets. 

Implicit in the sociological ideas we have presented is the argument that the 
grouping represented by the idea of a “market” is not sufficient to delimit the 
firms that should be considered together in explaining (and detecting) com- 
monalities in structural changes. Sociological theories that focus on the 
socially constructed nature of the worlds in which managers operate cannot 
unproblematically accept groupings as based on objective criteria that fail to 
ask whether managers within firms perceive and act based on these criteria. 
Given that similarities in perception and practices produce similarity in 
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behavior in these conceptions, it is necessary to determine empirically which 
firms are similar in their practices and perceptions. If the mechanism posited 
is one in which managers examine other firms they consider to be similar to 
their own for clues about innovations, etc, then the relevant grouping is one 
that corresponds to the grouping used by managers in the relevant firms. 

The notion of an “organizational field” has been used to describe these 
groupings (DiMaggio & Powell 1983, DiMaggio 1985, 1988; Scott & Meyer 
1983, Fligstein 1989b). This concept can usefully be seen as analogous to the 
earlier one of a “reference group,” in that the key issue in assessing action is 
determining the group of others who are looked to for comparison and 
modelling. Several difficulties arise in using this concept, however. For 
example, the set of firms that can be grouped together may change based on 
the kind of action under consideration. For example, a firm that manufactures 
tin cans may look to other tin can manufacturers when developing man- 
ufacturing procèsses, to other container manufacturers on marketing issues, to 
other medium-size companies on issues around organization and administra- 
tion, and to local leading companies when deciding about philanthropy. 
Further, to the extent that the grouping relies on the perceptions of managers, 
these perceptions may not be symmetric, producing organizational fields 
specific to particular organizations. 

These issues, while complex, yield a wealth of theoretical and empirical 
subjects for investigation. Beyond the obvious issues of establishing the 
relevant organizational fields, there are problems of delineating the dynamics 
of these fields. These dynamics have two distinct aspects: changes in the 
organizations within a field, and changes in the composition of fields. 

There are several potential sources of innovative behavior within a field: 
organizations facing problems that seem to demand solutions outside the 
repertoire of normal practice, organizations entering fields and producing 
behavior consequential for existing organizations, exogenous “shocks” which 
make the assumptions underlying the action of all the firms in a field 
untenable, and state action. For innovations to become widespread, however, 
other firms must adopt them. Under what conditions will such an adoption 
take place? In the language of institutionalization theory, the problem is 
determining when innovations will achieve legitimacy and diffuse through a 
field. Answering this question requires examination of failed as well as 
successful innovations, a research area which (for obvious reasons) has been 
neglected. It also requires a way of thinking about the state of a field as a 
whole: how stable is the membership of the field, how vulnerable is the field 
to new entries, how do organizational fields respond to environmental 
shocks? i 

The second aspect of the dynamics of organizational fields involves 
membership. If fields describe managerial perceptions about similarity on 
certain dimensions, how are those perceptions formed, and how might they 
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change? The diffusion of structural innovations in this century has clearly 
been accompanied with a progressive redefinition of the nature of firms, from 
identification based on products and markets to identification based on size 
and financial strategies. It is important to investigate how such shifts take 
place. 

Our stress on the importance of managerial perceptions of the nature of the 
worlds in which their firms operate, coupled with our earlier discussion of the 
role of broadly held beliefs in producing similarities in the actions of firms, 
points to the need to join models of internal decisionmaking with theories 
about diffusion and institutionalization. While the Carnegie school theorists 
generally neglected the role of actors outside the organization in influencing 
changes within the organization, there is no necessary reason why the model 
of incremental change cannot be used to predict the conditions under which 
solutions generated from outside the organization will be adopted. Coupled 
with an understanding of the ways in which powerful internal actors can use 
events outside the organization to advance policies beneficial to themselves, 
such a model would constitute an important advance in understanding the 
dynamics of organizational change. 

The program which we have presented is an ambitious one, but it is 
important at this stage to develop a coherent sociological alternative to 
economic approaches to profitmaking firms. The point that firms and markets 
are social phenomena is well established. Our review of economic explana- 
tions for change in organizational structure reveals that there is ample room 
for sociological accounts. The challenge now is to turn this insight into more 
rigorous theoretical and empirical work. 
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Abstract 


This review article focuses on the three control mechanisins that govern 
economic transactions between actors: price, authority, and trust. In contrast 
to conventional approaches that view market and hierarchy as mutually 
exclusive control mechanisms (or as poles of a continuum), we argue that 
price, authority, and trust are independent and can be combined in a variety of 
ways. For instance, price and authority are often played off each other within 
firms, while trust and price are sometimes intertwined to control transactions 
between firms. We also identify a type of organization largely ignored in the 
literature: the plural form. In the plural form, organizations simultaneously 
operate distinct, control mechanisms for the same function. For example, 
organizations operate franchises and company-owned units under the same 
trademark, and companies sometimes make and buy the same part. To 
understand this form, the analytic focus must move from individual transac- 
tions to the broader architecture of control mechanisms. 


INTRODUCTION 


A growing body of recent research has documented the variety of ways 
industry is organized. The sharp delineation of markets and hierarchies 
offered by Coase (1937) has gradually given way to the widely accepted 
recognition that myriad organizational forms exist along with markets and 
hierarchies. In particular, much research has pointed out the existence of 
stable long-term relationships between independent exchange partners. Coop- 
erative arrangements (Richardson 1972), relational contracting (Macneil 
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1978, Goldberg 1980), joint ventures (Mariti & Smiley 1983), quasifirms 
(Eccles 1981), global coalitions (Porter. & Fuller 1986), and dynamic net- 
works (Miles & Snow 1986) are but a few of the names these complex forms 
go by. 

Explaining the existence of these assorted forms usually involves catalog- 
ing the deficiencies (or failures) of fully integrated hierarchies and/or com- 
petitive markets and then arguing that the new forms solve, or at least 
mitigate, those problems. A thread weaving its way through the discussions of 
these complex organizational forms is that they fall between markets and 
hierarchies, defined as ideal types. However, the presumption that a con- 
tinuum runs from market to hierarchy—-with relationships between tech- 
nologically separable units arraying themselves along it—is misleading, as 
are the three-fold typologies which simply add a category to the market and 
hierarchy dichotomy (Ouchi 1980, Williamson 1985). These approaches rest 
on the premise that market and hierarchy are mutually exclusive means to 
govern transactions. 

There are two major problems with this premise. First, elements of the 
ideal types (or the poles of the continuum) are often found mixed together 
empirically: for instance, features of markets and hierarchies are often com- 
bined (Eccles 1985, Stinchcombe 1985). Second, markets and hierarchies and 
“combinations” are often embedded in or lead to the formation of collateral 
social structures (Macauley 1963, Macneil 1978, Granovetter 1985). The 
trust produced by these social structures does not simply replace market and 
hierarchy; frequently it complements the two forms. While transactions are 
rarely governed solely by market, hierarchy, or trust, these mechanisms do 
serve as the building blocks for the complex social structures so common in 
organizational life. 

This review article is organized around the three control mechanisms .that 
govern economic transactions between actors: price, authority, and trust. In 
contrast to more conventional approaches to the organization of industry, 
however, this review emphasizes how these control mechanisms are com- 
bined in empirical situations. The next section reviews the transaction cost 
economic approach, which represents the most ambitious attempt to explain 
to what extent companies may choose to integrate their operations. Although 
it is wedded to the view that markets and hierarchies are at opposite ends of a 
continuum, transaction cost economics raises many of the central issues that 
face actors and observers of organizations. The following section explores the 
relationship. between price and authority. Both inside and outside the firm we 
find the concurrent operation of these two mechanisms. The section on Trust 
considers trust as a control mechanism. We argue that trust often emanates 
from social norms and personal relationships, and we examine how these 
factors affect economic exchanges. Trust can also be found combined with 
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price and authority to govern the economic relations between actors, and 
those cases are considered in the section on Trust, Price, and Authority. 

The final section of the paper steps back from the explanation of the control 
mechanisms impinging on single transactions and argues that in some cases 
the overall structure of transactions can affect the management of each 
transaction. Specifically, we examine Plural Forms, where distinct and dif- 
ferent control mechanisms in the same organizational structure are operated 
simultaneously by a company to perform the same function. The dynamics at 
work in plural forms, such as organizations with both franchises and com- 
pany-owned units, cannot be fully understood on the basis of analyzing single 
transactions. 


MARKETS AND HIERARCHIES 


Markets and hierarchies have long been viewed as alternative mechanisms for 
allocating resources. Coase’s (1937) classic article about the origins of mar- 
kets and hierarchies stands at the center of a research tradition that. continues 
to flourish. Coase (1937:392) succinctly summarized his ‘argument: “The 
operation of a market costs something and by forming an organisation and 
allowing some authority (an “entrepreneur’) to direct the resources, certain 
marketing costs are saved.” The marketing costs to which he referred included 
the costs associated with determining relevant prices and negotiating and 
concluding contracts. 

Coase’s insight has been extended and refined by many scholars, but the 
basic argument remains the same: transactions. will. be governed by the 
institutional arrangement that is most.efficient. Williamson (1975, 1985) has 
led the development of this stream of thought—which has come to be known 
as the “transaction cost economics approach”——-and his work most fully 
elaborates on Coase’s idea. Williamson (1985) identifies three dimensions of 
transactions which he and other scholars contend dictate whether markets or 
hierarchies are most efficient: uncertainty, asset specificity, and frequency. 
These dimensions affect the costs associated with writing, executing, and 
enforcing contracts; when such costs are high, markets fail and hierarchies 
emerge in their place. 

The logic underlying how the three dimensions determine the institutional 
structure is straightforward. [Williamson (1985) explicates each dimension in 
great detail.]. Hierarchy replaces markets when: (a) high uncertainty about 
individual performance in groups (Alchian & Demsetz 1972) and/or high 

uncertainty about future contingencies exists, both of which complicate writ- 
ing contracts; (b) highly specific assets are involved in the transaction (i.e. 
nonredeployable assets) which may lead to post-contractual opportunistic 
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bargaining (Klein et al 1978); and (c) frequent (and therefore costly) 
recontracting occurs. In these instances, it is argued, the installation of an 
authority relation (hierarchy) between the contracting parties is less costly 
than an arms-length market transaction. Masten (1984) and Monteverde & 
Teece (1982a) find empirical support for these propositions in their studies of 
make-or-buy (hierarchy-or-market) decisions about component parts in the 
aerospace and automotive industries. ! 

The make-or-buy studies exemplify a core assumption of the transaction 
cost approach: that markets and hierarchies are mutually exclusive means of 
allocating resources. That is not to say markets and hierarchies are the only 
ways to transact business. A growing body of work documents the existence 
of nonmarket and nonhierarchical organizational forms—forms typically said 
to reside between markets and hierarchies (Richardson 1972, Eccles, 1981, 
Monteverde & Teece 1982b, Mariti & Smiley 1983, Williamson 1985, 
Powell 1987). This stream of work highlights the fact that relations between 
independent exchange partners can be stabilized through formal (e.g. written 
contracts, “hostage exchanges”) and informal mechanisms (e.g. trust). These 
arrangements are said to enable firms to: gain access to know-how unavailable 
in-house; spread the risks associated with uncertain ventures; benefit from 
economies of scale; enter new product and geographic markets rapidly; 
manage interorganizational dependencies; and respond quickly and flexibly to 
changing circumstances (Pfeffer & Nowak 1976, Porter & Fuller 1986, Miles 
& Snow 1986, Powell 1987, Johnston & Lawrence 1988, Ouchi & Bolton 
1988). 

Williamson (1981, 1985) attempts to integrate these empirical observations 
and explanations into the transaction cost approach by arguing that these 
organizational arrangements are a response to efficiency considerations. He 
operationalizes this argument by dimensionalizing the transaction cost vari- 
ables: intermediate levels of uncertainty or asset specificity lead to in- 
termediate forms such as quasi-vertical integration (Monteverde & Teece 
1982b). Williamson (1985) tags the middle ground with the name “relational 
contracting,” representing cases where a contractual market relationship 1s 
stable and enduring. Williamson argues that the opportunism which might 
compromise transactions of the relational type, particularly when transaction- 
specific assets are involved, is often mitigated by ex ante incentive arrange- 
ments such as hostage exchanges (see Williamson 1985, chs 7 and 8). 

Williamson’s amendment to the transaction-cost framework is a stopgap 


‘The Monteverde & Teece (1982a) study foreshadows a point addressed in the section on 
Plural Forms. They dichotomize their dependent variable: If over 80% of the volume of a part is 
manufactured in-house,, then the part is coded as being vertically integrated. Such a coding ` 
scheme, by slavishly adhering to the markets and hierarchies framework, ignores the obvious and 
fascinating issue of why companies so often both make and buy. 
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measure which does little to contain a growing number of empirical anoma- 
lies.? It is becoming clear that market, hierarchy, and relational contracting 
are not mutually exclusive control mechanisms. For instance, relational con- 
tracts are frequently laced with elements of hierarchy (Corey 1978, Stinch- 
combe 1985). Moreover, markets exhibit traits of hierarchies, and hierarchies 
display properties of markets (Eccles 1985, Stinchcombe 1985, Eccles & 
White 1988). These studies and others reviewed in the next three sections 
suggest that the control mechanisms, price, authority, and trust—which map 
roughly on to market, hierarchy, and relational contracting-——are useful con- 
cepts provided we recognize that they are independent and can be combined 
with each other in a variety of ways. 


PRICE AND AUTHORITY 


This section explores how price and authority are combined in intrafirm and 
interfirm transactions. Price mechanisms are sometimes built into hierarchies 
of authority, and authority mechanisms sometimes bind independent ex- 
change partners in a market. To understand these arrangements fully one must 
recognize the interplay between the two mechanisms. 


Price and Authority in the Firm 


Nowhere is the explicit mixing of price and authority more visible than in the 
modern multidivisional firm. With its profit-centers and transfer-pricing 
schemes, the modern firm is widely recognized as introducing features of 
markets into hierarchies (Chandler 1962, Vancil 1978, Anthony & Dearden 
1980). But a firm’s transfer-pricing policies represent more than a simple 
attempt to create a market in the firm. Eccles (1985) found that transfer- 
pricing policies ranged from mandating internal exchanges on a cost basis to 
allowing profit centers full exchange autonomy with either inside or outside 
suppliers. In the former case, authority reigns; in the latter instance, price 
predominates. Even at the extremes, however, the shadow of the other control 
mechanism is visible. 

Eccles & White (1988) dissect the exchange autonomy policy and find that 
managers have a bias against using internal suppliers. They argue that this 
bias emanates from the risks associated with transacting with someone in the 
same hierarchy of authority. In a transaction between two profit centers, if a 
delivery is late or quality is poor, what had been a market relation can, and 
managers fear will, move through hierarchical channels to a higher authority. 
“In the process, the profit center managers are subject to closer scrutiny, and 


*For more extensive critiques of the transaction cost. approach, see Perrow (1986) and 
Oberschall & Leifer (1986). 
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they may lose some of their autonomy or even endanger their career pros- 
pects” (Eccles & White 1988:8546). 

Mandated internal transfers display the same tension between price and 
authority. Mandated transactions are usually made at cost or at the prevailing 
market price. Eccles & White provide several examples of the acrimony 
between division managers precipitated by disagreements over what the full 
costs of a product are and what a true market price is. The conflict is 
exacerbated by managerial reward systems that hinge on divisional per- 
formance; one manager’s victory is another manager’s loss. Nevertheless, - 
mandated transfers are common despite the widely held perception among 
managers that the transaction costs associated with internal transactions ex- 
ceed those of external transactions. This paradox can be understood if one 
recognizes how the conflicts over prices and costs play out under the watchful 
eyes of hierarchical authorities: “Conflict makes information available to top 
management that otherwise might not be known or would be difficult or 
expensive to obtain” (Eccles & White 1988 : $30). 

Transfer-pricing policies provide a crystal clear example of how price and 
authority can be intertwined. In these examples, viewing a transaction as 
being governed by price or authority muddles the subtlety and complexity of 
organizational life. In particular, it obfuscates the multifaceted ways ex- 
ecutives use mixtures of these control mechanisms to manage their op- 
erations. 


Price and Authority in the Market 


Similar forces are often at work in the market. Stinchcombe (1985), Ouchi & 
Bolton (1988), Pfeffer & Nowak (1976) and Abolafia (1984) offer insights 
into how price and authority are combined in markets. Their work responds to 
the occasional admonishment by social scientists that we must not let the 
theory (and myth) of the self-governing, price-controlled market obscure our 
understanding of how markets actually function as concrete social structures 
(Barber 1977, Coleman 1984, Baker 1984, Hirschman 1986, Leifer & White 
1987). 

Stinchcombe (1985) offers a variety of examples—from weapons procure- 
ment to North Sea oil refining to franchise relationships in the automobile 
industry—which illustrate the simultaneous use of authority and price in 
market transactions. He documents several cases where market exchanges are 
plagued with many of the transactional liabilities highlighted in the section on 
Markets and Hierarchies: ex ante product specifications do not exist; design 
and production costs are uncertain; and performance measurement is difficult. 
Yet he finds contracts between independent agents under these conditions. 
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“A structure with legitimate authority, with a manipulable incentive system, with a method 
for adjusting costs, quantities, and prices, with a structure for dispute resolution, and with a 
set of standard operating procedures, looks very much like a hierarchy, very little like a 
competitive market. Yet all these features of hierarchy are routinely obtained by contracts 
between firms in some sector of the economy” (1985: 126). 


Authority mechanisms are written into contracts and also exist implicitly by 
virtue of industry practice. Quality control systems, inspection arrangements, 
and agreements about who has the authority to modify contractual provisions 
are just a few ways authority impinges on the market. Corey (1978) finds the 
same forces at work in his detailed study of the procurement process in 
industrial firms. 

Ouchi & Bolton (1988) report that similar mechanisms govern joint re- 
search and development efforts in the field of high technology. In this area, 
the task is often uncertain and complex, and hence contracts are difficult to 
write and monitor. Nonetheless, interorganizational agreements are a com- 
mon way to develop new technologies (Mariti & Smiley 1983, Friar & 
Horwitch 1985). Ouchi & Bolton’s case studies indicate that the problems 
associated with the inherent uncertainty and complexity of technological 
research and development are remedied by “the close and intimate supervision 
of a commonly agreed upon hierarchy” (Ouchi & Bolton 1988:14). The 
creation of an independent entity controlled by-a separate hierarchy es- 
tablishes an authority structure (board of directors, specified rules for 
participation, etc) which binds together the independent exchange partners. 
At the same time, the price mechanism is involved, since the independent 
entity often owns the property produced by the joint effort, and price may be 
used to allocate that property to the participants. 

Pfeffer & Nowak (1976) argue that the use of joint ventures is not confined 
to organizations attempting to develop new technologies or to undertake new 
activities. Joint ventures are also used to manage interorganizational in- 
terdependence. Pfeffer & Nowak studied 166 joint ventures among man- 
ufacturing organizations and participants in the- oil and gas extraction in- 
dustry. They found that firms in interdependent businesses were significantly 
more likely to form joint ventures than were firms in independent businesses. 
The authors argue that by introducing elements of authority into the market 
by creating a joint venture—companies can ensure continuity with their 
sources of supply or their customers (symbiotic interdependence), reduce the 
vigor of price competition (competitive interdependence), or both. Joint 
ventures are just one of several strategies which can be used by companies to 
manage their dependence on uncertain environments [see Scott (1987: 185~ 
94) for a useful review of these “bridging” mechanisms]. 

The work of Stinchcombe, Corey, Ouchi & Bolton, and Pfeffer & 
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Nowak all indicate that price and authority are woven together to control 
transactions in a market. Abolafia’s study (1984) of the formal and informal 
mechanisms that govern futures trading on the Chicago Board of Trade 
provides another angle from which to view the role of price and authority. The 
actual market where futures contracts are traded—the pit—is governed by 
price.? Abolafia (1984:132) points out, though, that “At the same time 
traders are overloaded and immersed in a continuous conflict with other 
traders, they are mutually dependent . . . to see that the market is maintained 
into the future so that each may continue to profit.” This concern with the 
viability of the market manifests itself in informal norms, as well as in a 
formal committee structure that monitors the behavior of traders and the 
activities of the market. The administrative structure has at its disposal several 
avenues of corrective action. In essence, an authority structure ensures that 
the price-driven market is maintained. 


TRUST 


Price and authority are specialized control mechanisms created for and 
attached to each transaction. There exists, however, a class of more general 
control mechanisms to which we assign the label trust. Gambetta, the editor 
of a recent collection of essays exploring the concept of trust, delineates what 
is meant when we say we trust someone: “[W]e implicitly mean that the 
probability that he will perform an action that is beneficial or at least not 
detrimental to us is high enough for us to consider engaging in some form of 
cooperation with him” (Gambetta 1988:217). Trust is a type of expectation 
that alleviates the fear that one’s exchange partner will act opportunistically. 
Of course, the risk of opportunism must be present for trust to operate. Lewis 
& Weigert (1985) contend that the trust is characterized by a cognitive “leap” 
beyond the expectations that reason and experience alone would warrant: 
where opportunism might be rationally expected, trust prevails. (It should be 
noted that hostage exchanges avoid trust by structuring the transactional 
context in such a way that opportunism becomes irrational.) 

Arrow (1974: 23) summarizes the obvious advantages of trust as a control 
mechanism: “Trust is an important lubricant of a social system. It is extremely 
efficient; it saves people a lot of trouble to have a fair degree of reliance on 
other people’s word.” Obtaining the advantages offered by trust, however, is 
not available to everyone. In a provocative discussion of how modes of trust 


3Baker’s (1984) elegant study of two commodity markets suggests that, even in the pit, price 
is not a simple and precise contro! mechanism. He finds that price is affected by the number of 
players in the market, with larger markets exhibiting greater price volatility than smaller markets. 
This is just one more example of the danger of assuming that the market naturally generates 
prices. 


PRICE, AUTHORITY AND TRUST 105 


were transformed around the beginning of this century, Zucker (1986) out- 
lines three different ways trust—‘“a set of expectations shared by all those in 
an exchange” (Zucker 1986:54)—is produced. Process-based trust emerges 
from recurrent transactions; characteristic-based trust rests on social sim- 
ilarity; and institution-based trust is tied to formal social structures. Zucker 
argues that immigration, internal migration, and business instability disrupted 
the first two modes of trust and led to institutions that produced trust, such as 
professions, bureaucracies, and financial intermediaries [see Shapiro (1987) 
for an interesting discussion of these formal institutions]. 

The discussion of trust in this section is narrower than Zucker’s and focuses 
on how trust arises out of the social context of transactions. We examine how 
(a) diffuse social norms of obligation and cooperation, and (b) personal 
relationships that overlap with economic exchanges provide the basis for 
trust. For more detailed discussions of trust as a sociological concept, we refer 
the reader to Lewis & Weigert (1985) who sketch how the cognitive, emotion- 
al, and behavioral elements of trust combine to form a unitary social ex- 
perience; the collection of essays in Gambetta (1988) provide a variety of 
empirical and theoretical contributions on this topic, too. 


Norms of Obligation and Cooperation 


Experiments in social psychology open the door to questions about the 
assumption of opportunistic actors. Kahneman et al (1986, 1987) report on 
experiments and survey research that illustrate the powerful effect norms of 
fairness have on economic exchanges. The perceived fairness of price and 
wage adjustments, for example, was found to be contingent on what respon- 
dents understood to be the cause of the adjustments. For instance, raising 
prices in response to increased demand was perceived to be unfair; in contrast, 
maintaining prices when costs declined was viewed as being fair. In the next 
section we discuss one way perceptions of fairness might affect exchange 
relationships. In another experiment conducted by Marwell & Ames (1979), 
subjects were presented with the choice of investing in a collective good or 
free-riding; their decisions did not conform to conventional ideas about 
self-interested, opportunistic actors. Despite theoretical assumptions to the 
contrary, these experiments. are suggestive of the norms that might guide 
behavior in some business settings, and when such norms are operative the 
risk of opportunism is: reduced. ` 

Macauley’s classic article (1963) on how and when contracts are used in 
business affairs stands at the center of this line of argument. Macauley found 
that “businessmen often prefer to rely on ‘a man’s word’ in a brief letter, a 
handshake, or. “common honesty and decency’—even where the transaction 
involves exposure to serious risks” (1963:58). And even when contracts were 
used, disputes were rarely litigated. As one businessman said, “You can settle 
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any dispute if you keep the lawyers and accountants out of it” (1963 :61). The - 
norms Macauley illuminated—keeping commitments and standing behind 
one’s products—impose obligations on parties to transactions, and those 
obligations are ignored at the peril of damaging one’s business reputation or 
personal friendships, or both. 

In line with Macauley’s findings, Macneil (1978) outlines a theory of 
relational contracting predicated on the idea that the maintenance of the 
relation is paramount. Macneil agrees with transaction cost economists that 
idiosyncratic investment may lead to a relationship, but he differs with them 
when he explicates how such relations are governed. He argues that two 
norms are particularly applicable to relational contracts: “(1) harmonizing 
conflict within the internal matrix of the relation . . . and (2) preservation of 
the relation” (1978:895). The first norm implies that the relation as it has 
developed—with its shared and often implicit understandings—serves as the 
reference point for resolving disputes. Macneil says, “As contractual relations 
expand, those relations take on more and more the characteristics of 
minisocieties and ministates” (1978 : 898). At the same time, internal resolu- 
tion of the conflict takes place against a backdrop of broader social norms and 
the desire of the parties to continue the relationship for both economic and 
social reasons. [See Whitford (1985) for an admiring review of Macneil’s 
work. ] 

Dore (1983) and Sabel & Zeitlin (1985) report on powerful norms of 
obligation which stabilize economic relationships in two other settings. Dore 
describes how the weaving segment of the textile industry in Japan changed 
from an order dominated by large, vertically integrated mills to a system 
where the production process is fragmented and coordinated among several 
independent family businesses. He attributes the success of this system to the 
presence of “goodwill” in Japanese trading relationships. Goodwill is defined 
as the “sentiments of friendship and the sense of diffuse personal obligation 
which accrue between individuals engaged in a recurring contractual eco- 
nomic exchange” (1983: 460). 

In Japan, trust is engendered by the social norm which insists that business 
relations are personal relations, not by overlapping personal and business ties 
(the topic of the next section.) Dore identifies a collection of characteristics 
that work to maintain this norm: the treatment of trading relationships as 
personal relationships; a recognized personal obligation to maintain the asso- 
ciation; a disdain for adversarial relationships; and a long-term outlook. The 
social norm is further bolstered by extensive financial cross-holdings among 
firms (Gerlach 1987). Gerlach (1987 : 133) contends that these cross-holdings 
serve to “create a structure of stable, mutual: relationships among trading 
partners.” 

In Sabel & Zeitlin’s detailed argument (1985) about the alternatives to 
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modern mass production, they also place a heavy emphasis on the ethos of the 
industrial community. They report that around the turn of the century (pri- 
marily in Europe) a variety of industries—among them silks, specialty steels, 
ribbons, and textiles—were either “confederations ‘of small shops, each 
specializing in a phase of production” (1985: 148) or “groupings of artisans’ 
shops under one roof” (1985: 150). The vitality of-these industries depended 
on a constant reshuffling of resources to meet ever-changing consumer de- 
mand. Sabel & Zeitlin point to the crucial role political institutions played in 
this process, but they also emphasize that “the process of socialization created 
a community across and within generations that protected the economy as a 
whole against the consequences of short-term calculations of advantage” 
(1985: 154). . 

The trust highlighted by Dore and Sabel & Zeitlin is exemplified by the fact 
that in both examples the authors report trading imbalances often existed 
between exchange partners. Accounts did not have to be “settled” each year, 
since there was a shared belief that in the long-run the rewards would be 
distributed fairly. Moreover, there was a sense that this year’s winner could 
be next year’s loser, so to press one’s advantage opportunistically would be 
ill-advised. Eccles & Crane (1988) report on a similar pattern of exchange 
imbalances between customers and banks in their study of the investment 
banking industry. Anthropologists studying gift exchange argue that the 
existence of imbalances (the time between reciprocation of a gift) requires 
trust (Mauss 1969 [1925]). l 

This discussion of trust has focused on interfirm relationships, but trust also 
plays an-important role in intrafirm interactions. Ouchi (1980), working out of 
the transaction-cost framework, argues that “clans” are sometimes the most 
efficient organizational form. Clans replace bureaucracy, he says, “when the 
ambiguity of performance evaluation becomes significantly greater than that 
which brings about market failure” (1980: 134). Through common beliefs and 
values, the clan creates goal congruence among its members and so reduces 
the risk of free-riding (the assumed nemesis of team production.) The work of 
Macauley, Dore, and Sabel & Zeitlin extends Ouchi’s argument by suggest- 
ing that clan-like relations can also occur across organizational boundaries. 

The norms of obligation and cooperation highlighted in these studies all 
imply the existence of some sort of community of shared values, whether they 
reside in a network of business relations (Macauley 1963), a country (Dore 
1983), a geographically bounded industrial district (Sabel & Zeitlin 1985), or 
a firm (Ouchi 1980). Zucker’s discussion of characteristic-based trust (1986) 
rests on this notion explicitly, with social similarity serving as an indicator of 
trust in her scheme. Determining exactly how trust is produced by such 
communities is elusive, particularly since trust possesses a self-fulfilling 
quality: the existence of trust gives one reason to trust (for both social and 
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transaction cost reasons), just as distrust begets distrust [see Banfield’s (1958) 
excellent study of the obstacles a community faces when distrust is the norm]. 
The form of trust illustrated above is often tied to formal structures (e.g. 
financial cross-holdings and political institutions) and informal sanctioning 
mechanisms. Still, regardless of the institutional scaffolding, the studies 
discussed above demonstrate the ways norms serve as a control mechanism 
for transactions. 

Before moving on to the role of personal relationships in economic life, we 
will briefly mention an argument for the evolution of cooperation which 
avoids norms and is based on an assumption of rational egoists. Many writers 
comment on the role that repeated exchanges play in the emergence of stable 
cooperative relationships. Axelrod’s (1984) imaginative experiment captures 
the essence of the argument employed by many writers. Axelrod found that in 
repeated trials of the prisoner’s dilemma game, the “Tit for Tat” strategy— 
cooperate on the first move and then copy your opponent’s previous move-—— 
outperformed all other strategies. Axelrod argues that the efficacy of this 
strategy hinges on the existence of future transactions: “The future can... 
cast a shadow back upon the present and thereby affect the current strategic 
situation” (1984:12). Rational egoists will cooperate, the argument goes, if 
future transactions offer benefits that outweigh those available by short-run 
opportunistic acts. 

Rarely, however, is the future preordained. The future is instead created on 
the basis of past and present actions. As Zucker (1986: 60) says, “In process- 
based trust, a record of prior exchange, often obtained secondhand or by 
imputation from outcomes of prior exchange, provides data on the exchange 
process.” Axelrod offers an explanation for cooperation that occurs once a 
shared expectation about the future exists; what is missing from his discussion 
is an understanding of how the future emerges—how relationships grow and 
trust develops. 


Personal Relationships in Economic Life 


The social norms discussed above are not the only source of trust in economic 
life. Concrete personal relationships also produce trust. Granovetter 
(1985:491) observes that “networks of social relations penetrate irregularly 
and in differing degrees in different sectors of economic life.” Where eco- 
nomic transactions are embedded in personal relationships the hazards of 
opportunism are diminished and the need for elaborate formal governance 
structures is rendered unnecessary. Ben-Porath (1980) pursues a similar 
argument and proposes that a continuum exists which places the family (high 
trust) at one pole and the impersonal market (low trust) at the other pole. 
Friends, business associates, employers, and employees reside in between the 
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two extremes, and require to varying extents the existence of institutional 
structures to protect against malfeasance. 

Palay’s (1985) study of the relationship between rail-freight carriers and 
automobile shippers is suggestive of the role of personal relationships and 
norms in economic life. Shipping automobiles by rail requires a significant 
investment in specialized autoracks tailored to the design specifications of 
each automobile model. Such a condition predicts a long-term contract or a 
hierarchical relationship—some way to dampen the hazards associated with 
specialized assets. Federal law, however, limited the contractual options 
available to parties. The asset-specific nature of the transaction combined 
with the regulatory restrictions created a problem, since “carriers did not want 
to make transaction-specific investments without shippers’ assurance that they 
would actually use the service” (1985: 163). 

Palay found that parties to such contracts solved this predicament by 
engaging in informal, general, rather unspecific, legally unenforceable agree- 
ments. He reported that the informal agreements were intimately bound up 
with the identities of the actual individuals or groups who made the agree- 
ments. When an individual “signatory” to an agreement moved to a different 
department or retired from the company, the informal contract often became 
the subject of dispute and had to be renegotiated. Palay also observed norms 
similar to those mentioned by Macauley. 

Macauley was also aware of the force of personal relationships in economic 
transactions. The norms he identified (and the informal agreements they 
enabled) were at least partially attributable to the cross-cutting relationships 
exchange partners had in other contexts. Macauley enumerated a variety of 
ways executives in two organizations might know each other: “They might sit 
together on government or trade committees. They may know each other 
socially and even belong to the same country club” (1963 : 63). In an interest- 
ing twist related to personal ties, Macauley found that some businessmen 
objected to elaborate contracts because such planning “indicates a lack of trust 
and blunts the demands of friendship, turning a cooperative venture into an 
antagonistic horsetrade” (1963: 64). 

The relationship between sales agents and their customers is a classic 
example of personal relationships overlapping with business ties. Because 
salespeople are at the boundary of their organization, they sometimes come to 
identify more closely with their customers than they do their own company. 
This fact often leads to the distrust of the salesperson by his or her own 
organization (Webster 1984:53). On top of the ambiguity surrounding organi- 
zational identification, salespeople may also develop personal relationships 
with members of the buying organization. These personal ties solidify the 
relationship between the buying and selling organizations, because they 
(presumably) reduce transaction costs as well as enable individuals to main- 
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tain valued personal relationships (Webster & Wind 1972:15). It is worth 
noting, however, that the companies employing salespeople frequently view 
such. personal relationships as a threat to their business.* 

Identifying the existence of trust and specifying how it is produced is a 
complicated task. Multiple subtle, mutually reinforcing mechanisms provide 
the basis for trust; norms of obligation, recurrent transactions, and personal 
relationships often overlap. Moreover, trust may come to affect exchanges in 
different ways: over time, a price- and authority-oriented exchange may be 
transformed into one characterized by informal arrangements based on trust; 
or, at a given time, trust from one context (personal friendships) may be 
transferable to another context (economic). While more questions than an- 
swers exist on this topic, it should be clear that the social context as man- 
ifested in trust serves as a powerful control mechanism, just as price and 
authority do. 


TRUST, PRICE AND AUTHORITY 


The previous section focused on the role trust plays in controlling exchanges. 
Often, though, trust does not work alone; it is intertwined with authority and 
price. This section begins with an examination of how trust and price play off 
each other. In an inversion of Granovetter’s (1985) embeddedness argument, 
the research reviewed here indicates that trust is sometimes embedded in the 
market. The section concludes with a few comments on the relationship 
between trust and hierarchy. 

In Eccles’ (1981 : 340) study of 26 residential homebuilders, he found that 
relationships between general contractors and subcontractors were “stable and 
continuous over fairly long periods of time and only infrequently established 
through competitive bidding.” Eccles called this bilateral arrangement the 
“quasifirm.” General contractors do not rely on trust alone, however: they 
_ periodically test the market and solicit bids from other subcontractors. The 
fact that relatively low levels of asset specificity characterize these transac- 
tions makes switching subcontractors fairly easy. At the same time, the 
presence of market prices also facilitates the development of trust and enables 
the creation of stable exchange relationships. 

The relationships and dynamics highlighted by Eccles can be found in 
several other industries. Mariotti & Cainarca (1986) paint a similar picture of 


“Granovetter (1985 : 492) makes the important point that “Both enormous trust and enormous 
malfeasance . . . may follow from personal relations.” Malfeasance is tied to personal relations in 
two main ways. First, trusting a person may leave you more vulnerable to their opportunistic 
behavior (although the opportunistic actor probably has only one chance to benefit from this 
situation.) Second, malfeasance is often dependent on the efforts of a group, which is bound 
together by trust. 
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the organization of general contractor and subcontractor relationships in the 
textile-clothing industry in Italy. Similarly, the investment banking industry 
displays this dynamic (Eccles & Crane 1988). While the industry is character- 
ized by strong relationships between bankers and companies, a second tier of 
banks is always in the wings that companies can and do turn to if they are not 
getting the service they desire. 

The trust—price control mechanism has recently received increased atten- 
tion. Several contemporary observers of organizational life have noted (and 
advocated) the use of. loose collections of independent firms, tied together by 
mutual understandings, to produce and market products (Miles & Snow 1986, 
Johnston & Lawrence 1988; Powell 1987 provides a useful review of work in 
this area). These confederations are sometimes born out of the disaggregation 
of fully integrated firms (Dore 1983, Mariotti & Cainarca 1986), and some- 
times they emerge out of the reconfiguration of former market relationships 
(Powell 1987, Johnston & Lawrence 1988). In almost all of these studies, 
mutual dependence between exchange partners is said to promote trust. This 
contrasts sharply with the argument central to transaction cost economics that 
bilateral:monopoly (i.e. dependence) fosters opportunistic behavior (William- 
son 1985). Mutual dependence, however, is rarely complete in these situa- 
tions: two or three suppliers may. replace ten or fifteen, for example. Options 
remain. Trust and price are joined. 

Several of the examples mentioned above, although certainly not all, 
pertain to project-based industries where uncertainty is high and asset 
specificity is low: switching is relatively easy. There is reason to believe, 
however, that switching is a credible threat even when it is economically 
irrational—e.g. even when asset specificity is high. Dore (1983) observed 
that Japanese . trading-partners were willing to abandon long-term rela- 
tionships when the other side was perceived as “not even trying” to match the 
best buy available in the market. A breach of trust may push people back into 
the market, even if it is costly to themselves; and that prospect may inhibit the 
breach in the first place. Indirect evidence for this proposition is offered by an 
experiment reported by Kahneman et al (1987). They found that people 
responded to violations of norms of fairness with a form of what we interpret 
as “mutually assured destruction’—that is, violators were punished even if 
such an action imposed an irrationally high cost on the enforcer.> Lewis & 


>The experiment was as follows. Person A is given $10 dollars-and told to divide it between 
themselves and an anonymous person R. Person R then has the option to accept or reject the offer. 
If R accepts the offer, the money is divided according to A’s plan; if R rejects it, they both receive 
nothing. The “rational” solution to this problem is for A to offer a token payment and for R to 
accept it. Neither prediction was bome out in the experiment. On average, person A offered 
between four and five dollars. More important for the argument here was the finding that on 
average the R’s rejected splits below $2.00—2.59 (there were three experimental groups.) 
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Weigert’s argument (1985) that trust contains an emotional component might 
help to explain the sense of betrayal and the accompanying retaliatory be- 
havior that Kahneman et al found when trust was violated. 

While this section has emphasized price-trust mechanisms, authority and 
trust also are mixed in the empirical world. One of the greatest contributions 
sociology has made to our understanding of organizations is its illumination of 
informal groups. These trust-bound groups are sometimes provoked by au- 
thority and serve to impede production (Roethlisberger & Dickson 1939, 
Burawoy 1979), while at other times they may work with authority to govern 
transactions (Walton & Hackman 1986). Although it is beyond the scope of 
this article to delve into the mammoth literature on informal groups, it is 
important to note that trust and authority are also mixed. 


PLURAL FORMS 


The previous sections show how viewing price, authority, and trust as sepa- 
rate but often intertwined control mechanisms enhances our understanding of 
inter- and intra-organizational control. For the most part, this perspective 
emphasizes the ways individual transactions and individual relationships are 
controlled. Such a micro focus, though, misses subtle and important features 
of the macro structure which affect individual transactions. 

This final section examines how transactions are embedded in a context of 
other transactions, as well as in a social context. Specifically, we consider 
“plural forms”—an arrangement where distinct organizational control mech- 
anisms are operated simultaneously for the same function by the same firm. 
For example, companies often make and buy the same part; companies 
frequently franchise units and own units in the same restaurant or hotel chain; 
and companies sometimes use a direct sales force and third party distributors. 
As these examples indicate, plural forms are comprised of two distinct control 
mechanisms. In the case of making and buying parts, authority and price are 
juxtaposed; with franchise units and company-owned units, a hybrid price- 
authority mechanism (franchise) is coupled with an authority mechanism. 
While most investigations of these mechanisms have focused on explaining 
how transactions are assigned to one mechanism or the other (typically by 
establishing which mechanism is most efficient), this section emphasizes how 
transactions controlled by one mechanism are profoundly affected by the 
simultaneous use of an alternative control mechanism. 

The forces at work in plural forms are alluded to in Walker & Weber’s - 
thorough study (1984) of make-or-buy decisions in the automobile industry. 
They estimate a structural equation model to test the impact of Williamson’s 
(1975) transaction cost variables on the decision to make or buy 60 different 
parts. They -hypothesized that buyer experience in producing a part has two 
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effects which push the make-or-buy decision in opposite directions. First, 
buyer experience may result in a production cost advantage for the buyer over 
the supplier. Second, and conversely, “buyers with a history of producing a 
component have better information about manufacturing it, and suppliers are 
thus less able to engage in opportunistic bargaining” Walker & Weber 
1984:377. Although the estimates for these paths did not reach the critical 
significance levels, they did have the predicted signs and the underlying logic 
of the relationships appears sound. Walker & Weber speculate that perhaps 
the relative simplicity of the parts under study dampened the expected effects. 

Walker & Weber take their argument one step further and propose a pattern 
of make-or-buy decisions which might extend over the life of a part. They 
suggest that: 


a cyclical pattern for complex components may be found in which components are brought 
into the firm so it can gain production experience and reduce uncertainty and then are 
shifted back to the market when contracting hazards can be managed. (1984:389) 


The same benefits can be obtained by operating the two forms si- 
multaneously and playing them off one another. When a company makes as 
well as buys, it not only possesses information it can use to manage the 
subcontractor (or franchisee), but the ideas and innovations of the sub- 
contractor can be brought to bear on the company’s business. Harrigan 
(1984:643) applies the term “taper integration” to a similar organizational 
form: “when firms are backward or forward integrated but rely on outsiders 
for a portion of their supplies or distribution.” She emphasizes the role such 
arrangements play in balancing economies of scale in the value chain and in 
reducing a company’s vulnerability to strikes and shortages; she also recog- 
nizes that such forms open up a new avenue of information about competitors’ 
activities. 

A hidden dimension of the plural form, however, and one we believe is 
crucial, is the control it affords to the managers of companies. They can use it 
to employ a racheting strategy where management and technical innovations 
generated by one mechanism can be borrowed and utilized by participants 
controlled by the other mechanism. Relying solely on an arm’s-length con- 
tractual arrangement or on an authority structure presents obvious difficulties. 
As Weber & Walker (1984) point out, contracting is problematic without 
in-house experience, and the maladies associated with hierarchy are widely 
recognized. The remedy for these difficulties may be the simultaneous use of 
the two mechanisms, creating in essence competition between them. 

Another way to view the control offered to managers by the plural form is 
suggested by White (1985). In a thought-provoking article about agency 
relationships, White (1985 : 209) argues that one of the prime difficulties with 
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such relationships is that they often become specialized, and this allows 
control to slip from the hands of the principal into the hands of the agent: 
“Specialization yields some measure of autonomy, which measure is curtailed 
by the use that authority makes of continuing realignment of boundaries.” The 
existence of a viable alternative mechanism may make boundary shifting 
relatively easy. l 

Franchising is an excellent example of the plural form. Almost all franchise 
systems are composed of company-owned units and franchise units. The 
company-owned units: are managed in a hierarchical authority structure. 
Within this authority structure, profit centers and management incentive 
programs introduce some elements of a price control mechanism. In contrast, 
the franchisee-owned units are managed by independent businesspeople who 
have signed long-term contracts with the company. The franchisee usually 
pays a flat fee up front and then pays a fixed percentage of revenue generated 
by the unit to the company: in the language of agency theory, the franchisee is 
the residual claimant of the revenue (and profit) stream. It is important to note 
that franchisees are not fully independent entrepreneurs; hierachical authority 
governs many of their activities. The contracts franchisees sign typically 
specify in great detail the manner in which the franchise is to be run in order to 
protect the value of the trademark. 

These two distinct control mechanisms are used by most companies en- 
gaged in franchising. For example, McDonald’s owns 2280 of its units and 
franchises 7274 units; Pizza Hut owns 2660 and franchises 2603; Sheraton 
Inns owns 148 hotels and franchises 335; Holiday Inns owns 204 hotels and 
franchises 1420 (“Ninth Annual Franchise 500,” Entrepreneur, January 
1988). Soft drink bottlers, food stores, and a variety of service businesses, 
among others, follow this same plural form structure. 

Explanations for the plural form in franchising have focused on the differ- 
ing abilities of company-owned and franchised units to inhibit opportunism 
and to operate efficiently (Caves & Murphy 1976, Rubin 1978, Brickly & 
Dark 1987). The assumptions guiding this work are that company-owned 
units provide to the company the entire profit stream, but such units suffer the 
liability that managers will shirk unless monitored. In contrast, franchisees, it 
is argued, operate efficiently because they have a piece of the pie, although 
they too must be monitored to ensure they do not degrade the trademark, or 
they must be located in places with repeat customers so that they internalize 
the costs of their opportunistic actions. While these explanations certainly 
have some merit, they are able to explain only a modest amount of the 
variance in these forms. The other commonsense explanation—that the need 
for capital early in a company’s life leads to franchising—is belied by the 
large number of financially vigorous firms that continue to franchise, as well 
as by the fact that acquiring capital from the relatively undiversified portfolios 
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of prospective franchisees may be an expensive way to generate funds (Rubin 
1978). 

While the need for control does shape the structure of the franchise system, 
to view it on a transaction-by-transaction basis is to conceive it too narrowly. 
The lack of success in explaining the organizational structure of franchise 
companies results from three related misconceptions: (a) that for every site 
one form is in fact superior to another; (b) that companies select a site, decide 
on the control mechanism, and promptly find a qualified manager/owner, and 
(c) that control is transaction specific. 

Instead, we hypothesize that in many cases either mechanism will work and 
the choice of mechanism is primarily a function of the vagaries of circum- 
stance—who comes up with the idea for a new site, who has cash, whether 
qualified managers are available, and so forth. Accordingly, interpreting the 
choice of a control mechanism for a specific site as a rational response to 
efficiency or control considerations may reveal little about the dynamics at 
work in these structures. While much more research needs to be done on these 
structures, we suspect that the contours of these systems (e.g. the number of 
franchise and company-owned units) are a product of microlevel decisions 
(which may be fairly idiosyncratic) and macrolevel strategic objectives which 
are at least in part driven by senior management’s recognition of the indirect 
control afforded by plural forms. 

The powerful indirect effect of the alternative mechanism on controlling the 
transaction follows a logic similar to that underlying the plural form make-or- 
buy mechanism discussed earlier. Franchises may be motivated to find more 
efficient operating procedures (which then can be employed by company- 
owned units), while company-owned .units may provide information that 
enables the company to negotiate with and control franchisees effectively. 
Furthermore, the boundaries between mechanisms are fluid—the annual fran- 
chise turnover rate (conversions to and from franchise units, plus franchise 
failures) is around 10% (Walker & Cross 1988); this almost certainly plays a 
part in controlling individual transactions.° 

Make-or-buy decisions and franchises are only the most obvious examples 
of the plural form. Along similar lines, many companies simultaneously 
employ a direct salesforce and utilize third party distributors (Anderson & 
Schmittlein 1984). There is anecdotal evidence that state and local gov- 
ernments are finding arrangements that mix public and private services more 
effective than those that rely fully on privatized services (Uchitelle 1988). 


“It should be noted that franchises not only exhibit the plural form, they also present nested 
control mechanisms. Multi-unit franchisees—where a franchisee owns more than one unit, which 
means that at least some of the units are run by salaried managers—are growing in popularity (US 
Department of Commerce, Franchising in the Economy 1985-1987, p. 5). This form essentially 
establishes hierarchical authority mechanisms within price-authority mechanisms (franchises). 
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These examples beg for an analysis of the dynamics that accompany the 
transactional architecture taken as a whole. In many cases, microanalytic 
approaches miss important features of these structures. 


CONCLUSION 


The ideal types of market and hierarchy serve as a useful starting point for 
studying the organization of industry. The assumption that these mechanisms 
are mutually exclusive, however, obscures rather than clarifies our un- 
derstanding. As this review has shown, price, authority, and trust are com- 
bined with each other in assorted ways in the empirical world. 

The combinations of control mechanisms discussed in this review have 
been characterized as overlapping, embedded, intertwined, juxtaposed, and 
nested. These descriptions were not intended as strict categories, but instead 
suggest that researchers need to embrace the subtlety and complexity of 
economic and social control mechanisms. Much of the complexity results 
because only occasionally are control mechanisms created on “greenfield” 
sites. Typically, control mechanisms are grafted onto and leveraged off of 
existing social structures. The installation of price mechanisms in hierarchies 
(Eccles 1985) and the functioning of business relationships in industrial 
communities (Sabel & Zeitlin 1985) are examples of this. 

The most sophisticated mixture of control mechanisms can be seen in the 
plural form. Here, two distinct control mechanisms are operated si- 
multaneously by the same company. To understand the plural form, the 
analytic focus must be moved away from exclusive attention to individual 
transactions; instead, the dynamics of whole structures must be examined 
since the transactional context affects the control that can be brought to bear 
on individual transactions. It is worth noting that matrix structures are a close 
cousin to the plural form. They are similar in that both are built on distinct 
control mechanisms, with matrix structures based on dual authority mecha- 
nisms; they differ because in matrix structures the mechanisms intersect while 
in plural forms the mechanisms run in parallel. In both cases, individual 
transactions are inextricably tied to the larger architecture of control mech- 
anisms. 

This article has only begun to sketch the dynamics of these complex 
organizational arrangements. Explanations for when and why different mix- 
tures of control mechanisms occur need to be developed. Such explanations 
may end up consistent with existing theories such as transaction cost eco- 
nomics. Little progress will be made, however, unless we step away from the 
theoretical baggage which insists that we view markets and hierarchies as 
mutually exclusive alternatives or ends of a continuum and replace that view 
with the recognition that human reason and social circumstance lead to much 
more complex forms of control. 
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Abstract 


Recent research on collective action has focused on the occurrence, timing, 
and sequencing of such events as regime changes, riots, revolutions, protests, 
and the founding of social movement organizations. Event analysis allows 
information on the duration, number of participants, presence of violence, or 
outcome of some particular type of collective action to be compared across 
social systems or across time periods. This review considers issues of defini- 
tion, measurement, and methods of estimation in event analysis. It also 
compares two general varieties of event analysis: approaches that model the 
dynamics of collective action as a process, and those that do not. A process- 
oriented approach evaluates how time and covariates (including past events) 
affect the timing and sequence of repeatable events, and it attempts to explain 
how events unfold over time. The nonprocess approaches summarize static 
relationships between levels or characteristics of units and some type of event 
count. 


INTRODUCTION 


Sociologists interested in the causes of social movements and their success or 
failure have long considered natural histories of movements and revolutions, 
profiles of activists, and catalogues of grievances. Recently, spurred mainly 
by Charles Tilly’s pathbreaking research, they have begun to focus on the rate 
of occurrence of collective events. This chapter reviews and evaluates 
methods for using information on events—especially their timing and se- 
quences—to analyze social movements and collective action.! 


'See Jenkins (1983), Marwell & Oliver (1984), Zald & McCarthy (1979), and Tarrow (1988a) 


for recent substantive reviews of theory and research on collective action. 


119 
0360-0572/89/08 15-0119$02.00 


120 OLZAK 


Event analysis has an extremely broad scope. Studies of conflicts have 
included race riots (Spilerman 1970), strikes and industrial protests (Amin- 
zade 1984), political violence (Tilly, Tilly & Tilly 1975), peasant rebellions 
(Paige 1975), revolutionary activity (Markoff 1986), lynchings (Inverarity 
1976), and coups d’état (Hannan & Carroll 1981). Studies of protest have 
considered black civil rights’ protests (McAdam 1982), marches against 
nuclear power plants (Walsh & Warland 1983), Québecois separatist demon- 
strations (Olzak 1982), union organizing of farm workers (Jenkins & Perrow 
1977), protests by American Indians (Nagel 1988), and social protests in Italy 
(Tarrow 1988b). Studies of foundings and disbandings of social movement 
organizations have included such varied assemblies as labor unions (Hannan 
& Freeman 1987), anti-drunk driving groups (McCarthy et al 1988), and civil 
rights’ organizations (Jenkins & Eckert 1986). 

Event analysis allows diverse forms of collective action to be measured and 
compared because observations are collected in commensurate dimensions. 
Thus, information about revolutions consisting of thousands of acts of vio- 
lence and about one-time confrontations is enumerated and described in terms 
of number of participants, duration of unrest, magnitude of violence, and 
other characteristics. This feature of the strategy is important because it 
permits cases of unsuccessful and/or short-lived movements to be compared 
with instances of successful and long-lived ones. It also allows stages that 
emerge during collective actions to be analyzed as a sequence of different 
kinds of events rather than as one continuous action. In short, the strategy of 
studying events uses more information about the dynamics of change in social 
movements than do strategies that treat movements as unitary. phenomena. 


THE DEVELOPMENT OF EVENT ANALYSIS 


Event analysis took shape during the 1960s in research on variation among 
countries in collective violence and on causes of race riots. Subsequent 
advances in methods for analyzing historical and time-ordered data on events 
have increased the power of event analysis and encouraged research on classic 
questions about the influence of timing and organizational dynamics on rates 
of collective action and on the success of social movements. 


Brief History of Event Analysis 


Cross-national research on collective violence in the 1960s related annual 
counts of events to economic, political, and other structural variables to learn 
whether peak periods of economic hardship were peak periods of unrest. 
These studies developed formalized rules for coding information on collective 
events using records from archives, newspapers, historical documents, and 
police or magistrate records (for a recent example, see Taylor & Jodice 1983). 
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They also produced data banks that allowed researchers to compare effects of 
structural and political variables on classes of events among nations and over 
historical periods (Rule & Tilly 1965, Feierabend & Feierabend 1966, 
Feierabend et al 1972). During this period, research moved beyond describing 
and categorizing stages in conflicts to analyze causes of unrest. This shift had 
two important consequences. First, use of event counts opened up the 
possibility that diverse social science methods could be used to analyze 
questions about social movements. Second, the substantive findings informed 
theoretical debates about social upheavals. In particular; theories of social 
protest that highlighted mechanisms of relative deprivation and social dis- 
organization unraveled once they were subjected to empirical scrutiny (Sny- 
der & Tilly 1972, Rule & Tilly 1972, a Tilly & Tilly 1975, Tilly 1981, 
Weede 1987). 

A concurrent line of research related demographic and socioeconomic 
characteristics of communities to counts of race riots. This research also 
informed theoretical debates about civil unrest. The findings challenged the 
assumption that communities experiencing greatest economic hardship, 
poverty, and relative and absolute deprivation were most likely to experience 
race riots (Lieberson & Silverman 1965, Wanderer 1969, Spilerman 1976, 
Eisinger 1973, McPhail & Wohlstein 1983). 


Motivations for Event Analysis 


Choice of a method for analyzing data on collective action depends upon a 
researcher’s purposes. If the goals involve testing generalizable theories with 
replicable evidence, then event analysis has several potential advantages. 
First, as noted above, event analysis can use statistical techniques and models 
that have been useful in other subfields of sociology and other sciences. 
Hypotheses can be tested and models can be compared on their relative merits 
(Spilerman 1971, Paige 1975). Although case studies provide rich detail and 
historical background, they rarely provide systematic evidence appropriate for 
testing hypotheses. For example, social scientists. commonly study only the 
places that have collective events in studies that try to learn the causes of 
events. For instance. Wanderer’s (1969) analysis of riot severity used data 
only for cities that had race riots, thereby ignoring characteristics of cities that 
did not have a riot and sidestepping the question of why riots occurred. 
Sampling on the dependent variable, as in these examples, confounds causal 
interpretation because (by definition) some observations have been excluded 
from analysis based upon their levels on the dependent variable (success or 
not, participation or not). Any attempt to infer causal relationships will be 
crippled by sample selection bias (Heckman 1979, Berk 1983). Sample 
selection bias has often been confused with the issue of representativeness of 
samples. They are not the same. For example, data on events occurring in 
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Utah may be questioned on the grounds that the sample is not a random one 
and because Utah is not representative of other geographical regions. If the 
goal is to estimate some mean number of events in the country as a whole, 
then using this sample will cause problems. If the goal is to estimate the 
effects of some covariates on the rate of activity, the estimates from such a 
sample may be unbiased. This is because nonrepresentativeness hampers 
one’s ability to estimate population means and variances but does not neces- 
sarily affect estimates of structural parameters, e.g. effects of covariates on 
the rates. Sample selection bias on the other hand means that a sample has 
been chosen based upon some value of the dependent variable, and effects of 
covariates on the rates will be biased unless appropriate corrections are 
applied (Heckman 1979). 

An important motivation for event analysis is thus to avoid sample selection 
bias. Research designs in event analysis avoid problems of sample selection 
bias by including both successful and unsuccessful movements (Gamson 
1975), cities with and without racial conflicts (Spilerman 1970), and samples 
drawn from populations of participants and nonparticipants (McAdam 1986). 

Furthermore, case studies of social movements normally do not exploit 
information on timing and sequences of events. One important form of event 
analysis, event-history analysis, does analyze such information. This is par- 
ticularly important in studying processes of temporal contagion in the spread 
of riots, rebellions, and other disturbances. 

Some have suggested that the spread of unrest by geographical contagion 
may also be important. In one case, consideration of spatial contagion altered 
substantive conclusions: Doreian (1981) found that spatial autocorrelation 
affected estimates of factors causing local variations in participation in the 
Huk rebellion in the Philippines. Investigating the intersection of social 
geography and diffusion holds promise for examining how proximity affects 
contagion (Markoff 1985). 

Whether the occurrence of collective actions means that a social movement 
exists is treated as problematic and open to empirical examination in event 
analysis. This stance anticipates the possibility that events might occur after a 
revolution has ended or before a social movement supposedly begins. This 
strategy stands in contrast to traditional ones that analyze social movements 
only within conventional temporal boundaries defined by the history of some 
movement or revolution. 

Restricting research to a priori temporal units of revolution, civil war, or 
social movements may miss much variation in activity over time and over 
regions. For instance, Tilly, Tilly, & Tilly (1975: Figure 6) show that the 
number of collective disturbances in France has four peaks during 1830-1930: 
disruptions of the revolution of 1830, the unrest surrounding the revolutionary 
activity in 1848, strikes around 1906, and social unrest during the mid-1930s. 
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What surprised even the authors was that the four peaks contained comparable 
numbers of violent events. Apparently other “revolutions” of equal signifi- 
cance occurred in addition to that in 1830. 

- Event analysis also allows empirical investigation of changes in forms of 
collective action. For example, Tilly’s (1986) analysis of French contentious 
gatherings reveals that collective actions were transformed over a hundred 
years from local to national scope. This approach also allows forms of conflict 
to be analyzed separately by target and degree of violence, facilitating ex- 
amination of similarities and differences in effects of covariates on types of 
events. In other words, claims that causes of various types of collective action 
differ can be examined empirically, rather than assumed. Causal links be- 
tween forms of collective action can also be examined. For instance, there has 
been much speculation but little evidence about the link between strikes and 
racial violence; some suggest that the occurrence of strikes (and the use of 
black strikebreakers) led to racial violence. Olzak (1989) found that growth of 
labor unions—but not strike frequency—raised rates of violence against 
blacks but decreased rates of violence against white immigrants in late 
nineteenth and early twentieth century America. 

Another source of the increasing popularity of event analysis is its flexibil- 
ity in using different levels of spatial aggregation. Events have been studied as 
products of groups, neighborhoods, electoral districts, export sectors, and 
nation-states. Consider the case of American race riots during the 1960s. 
Stark et al (1974) focused on geographical spread during one year of race 
rioting over census tracts of one city, Los Angeles. Spilerman (1970) anal- 
yzed data on the annual occurrence of riots in numerous cities in order to learn 
whether characteristics of cities affect the probability of a riot. Still others 
analyzed yearly counts of riots for the United States as a whole in order to 
learn whether they fluctuated with variation in the national political and 
economic environment (Kelly & Isaac 1984). 


Promising Developments in Event Analysis 


Researchers have built on the studies of the 1960s and have expanded the 
repertoire of methods for collecting and analyzing events. Two areas deserve 
special note. The first concerns life histories of social movement organiza- 
tions (SMQs). Gamson (1975) and McCarthy & Zald (1977) proposed a 
research agenda that emphasized the importance of gathering information on 
such organizations. But this agenda was not quickly implemented. Until 
recently only Steedly & Foley (1979) and Goldstone (1980) analyzed SMOs 
directly (and both used Gamson’s data). This lack of attention to organiza- 
tions is curious given the centrality of organizational structure in resource 
mobilization theories of collective action. However, the situation seems to be 
changing. Hannan & Freeman (1987, 1989), McCarthy et al (1988), and 
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Carroll & Huo (1988) have studied the founding and disbanding rates of 
SMOs, and Rosenthal et al (1985) studied network relationships among 
organizations in the women’s movement. 

A second promising line of research investigates “cycles” or waves of 
protest empirically. Beginning with Durkheim’s analyses of social disorgan- 
ization, the idea that social movements come in recurrent patterns has in- 
trigued sociologists. Although waves, cycles, and episodes serve as core 
concepts in theories of collective behavior (Turner & Killian 1957), few 
researchers have studied such patterns. Hamblin et al (1973) suggested that 
mathematical models of contagion could be applied to analysis of diffusion of 
collective events. Only Spilerman (1971) and Olzak (1987a) have modeled 
processes of contagion directly. 

Both developments—analysis of organizational events and analysis of 
trajectories of diffusion—highlight the importance of gathering temporal data. 
Theory and research now attend to the effects of the histories of social 
movements (and their internal organizational developments) and of sequences 
of protests and conflicts on the likelihood of future events. 


DEFINITIONS IN EVENT ANALYSIS 


Conceptual Definitions 


This research tradition commonly defines events as nonroutine, collective, 
and public acts that involve ‘claims on behalf of a larger collective (Paige 
1975, Tilly 1978). Thus a minimal definition of collective action is that it (a) 
involves more than one person, and (b) makes claims of agency (or corporate) 
status. Tilly’s (1978) list of claims includes petitioning, memorializing, and 
opposition to or support of an enemy of the government. Tilly (1978: 275) 
defines agency status as involving acts that “make a visible claim which, if 
realized, would affect the interests of some specific person(s) or group(s) 
outside their own number.” 

Should social movements and events be defined independently? If so, what 
should be the distinguishing criteria? According to the minimal definition, 
social movements and collective actions are separable. Social movements are 
“a group of people identified by their attachment to some particular set of 
beliefs” (Tilly 1978: 9). Conceptual definitions of collective actions normally 
do not require the presence of shared beliefs among actors. However, some 
researchers stipulate that collective actions involve groups with preexisting 
solidarity (e.g. Paige 1975: 90).? Others implicitly assume that solidarity is 


7In the mid-1970s' there was probably less consensus about what distinguished political 
collective actions from panics, fads, etc. To test theories about political collective actions, some 
distinctions among types of collective behavior were necessary. Thus Paige’s motivation for 
including solidarity was therefore to exclude collective behaviors such as “panics, mass move- 
ments of refugees, crazes . . .” (1975: 87, 390). 
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necessary for the types of collective actions they have analyzed, but leave 
open the issue of whether solidarity is necessary in other instances. 

Some researchers also stipulate that collective action be noninstitutional 
(Paige 1975), while others take this position implicitly by limiting analysis to 
actions instigated by challenging groups that are “countercultural” or that 
have little formal power (Kriesi 1988). Perhaps because it implies a process, 
institutionalization has proved to be difficult to operationalize. Groups that 
have already gained acceptance in the political arena are thus often excluded 
from analysis of social movements and collective action. Lowi (1971) pro- 
vided a rationale for this practice, claiming that SMOs that succeed in 
establishing routinized and stable ties with a government agency have been 
transformed into interest groups. In this view, strikes by most labor unions are 
now actions of-interest groups, not social movements (Snyder 1975). Using 
Lowi’s criteria, actions of some “countermovements” could be excluded, 
despite the fact that research on them has illuminated ways that members of 
groups with access to power sometimes form significant social movements. 
Examples include studies of the pronuclear power movement (Useem & Zald 
1982), the antiabortion movement (McCarthy 1987), and coups d’état by 
dissatisfied army units (e.g. Feierabend & Feierabend 1966). 

Tilly takes a different tack. He defines collective action to potentially 
include actions of all contenders, including both “members,” who are those 
who have “routine, low-cost access to resources controlled by the govern- 
ment,” and “challengers,” who are other contenders (Tilly 1978: 52). If, as 
Tilly suggests, both members of the polity and outsiders can conduct social 
movements, then we should analyze transformations of outsiders into mem- 
bers of the polity as events in the life histories of social movements. Such 
analysis requires attention to the boundaries that separate insiders from outsid- 
ers. 

But there is disagreement about whether institutionalization of social move- 
ments belongs in the study of collective action. Some researchers argue for 
excluding institutional behavior because it concerns elite activity rather than 
spontaneous grass-roots mobilization (Jenkins & Perrow 1977, McAdam 
1982). Others think that it is unreasonable to truncate observations on SMOs 
at the point when they gain access to the political arena. Such exclusion 
assumes that institutionalization is a one-way process and ignores the fact that 
some established groups become delegitimated and excluded from the polity. 
Excluding institutional action from studies of collective action by definition 
makes it impossible to address a fundamental sociological process: how 
charismatic movements become routinized and institutionalized over time. 

If we add institutional activities to the list of potential collective actions, we 
face the prospect that the concept encompasses virtually all social behavior 
including everyday actions such as business conferences, professional meet- 
ings, and church assemblies. Tilly’s (1978) emphasis on excluding routine 


126 OLZAK 


activity has been a key solution to. reduce the scope. There are often good 
reasons to exclude anniversary marches or annual celebrations since their 
timing is predetermined and not spontaneous. Researchers also usually ex- 
clude routine congressional activities, court sessions, political party conven- 
tions, and regular government activity from the concept of collective action 
(Burstein 1985). According to Tilly’s guideline, routine church activities 
would not be coded as events, but nonroutine actions of church members 
participating in civil rights’ marches under the leadership of a minister might 
be. 

The minimalist definition of collective events has advantages over defini- 
tions that include dimensions such as size, noninstitutional status, violence, 
and political orientation as defining features of collective action. Relaxing the 
requirement that collective action involves self-conscious identity, solidarity, 
or shared sentiment allows researchers to examine whether solidarity affects 
rates of collective action. In contrast, enumerating only events that reflect 
solidarity vitiates the possibility of studying the effects of variations in 
solidarity. It renders tautological the key claim of resource-mobilization 
theory that solidarity affects collective action. So it appears that a sensible 
strategy in event analysis 1s to begin with minimal definitions of events so that 
the effects of solidarity, institutionalization, and other dimensions can be 
examined empirically. 

Use of the minimal definition has a disadvantage. Coding costs increase 
with the scope of relevant actions. In addition, effort is needed to narrow the 
focus of the research later rather than earlier in the research process. But in 
many cases, the capacity to narrow the scope, or to compare different forms 
of events, after the data are collected may outweigh these costs. 


Operational Definitions 


At what size does an event becomes collective? The answer varies widely 
among studies and types of event. Most researchers choose a minimum in the 
range of 10 and 20; Tilly (1978) proposed a minimum of 50, and Olzak and 
West (1987) use a minimum of 2. But sources of data often lack information 
on exact numbers. Olzak (1987) found that daily accounts of ethnic protests in 
the New York Times provide reliable counts of participants for only about half 
of the events. Fortunately, newspaper reports nearly always contain key 
phrases such as “small group of hooligans” and “large crowd” ‘that indicate 
approximate size categories. 

Most researchers hold that events occur in one local setting, although some 
Civil Rights campaigns and intercity marches are exceptions to this rule 
(Morris 1984). But geographical units change over time, and information is 
sometimes found in incompatible units. Markoff & Shapiro (1973) discuss 
solutions for problems that occur when geographical units change over 
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time because boundaries of census, political, electoral, and other kinds of 
districts change. 

Decisions about temporal boundaries of events are especially complicated. 
When does one event stop and a new one begin when events are closely 
spaced in time? Paige (1975), Tilly (1978), and Olzak (1987) judge a new 
event to have begun if activity resumes after a pause of at least 24 hours in a 
locale. Otherwise, activity of more or less continuous actions by the same 
group is considered as one event. 

Marwell & Oliver (1984) argue that “collective campaigns” are more useful 
units than are events. Certainly the concept of a campaign can be useful, as in 
the case of the Civil Rights’ Movement in which marches carried across state 
boundaries. However, the concept of campaign also faces serious problems. 
When does an event become a campaign? This would be a problem for an 
abortive campaign or an isolated event that might have begun a campaign but 
failed. And, when does a campaign end? Just as smaller geographical units 
are more easily aggregated to larger entities than vice-versa (Markoff & 
Shapiro 1973), a strategy of analysis (rather than measurement) that specifies 
how to link events that make up a campaign may be more useful than one that 
begins with observations on campaigns. Moreover, collapsing brief episodes 
of many events into one campaign makes the occurrence of many small events 
indistinguishable from one uninterrupted protest. Such a coding strategy 
makes it difficult to discover cycles of protests. 

It is often important to measure variation in intensity or magnitude of 
events. Some researchers combine information on numbers of participants 
and duration of events in order to measure intensity. One common approach 
weights events by their duration (in days) multiplied by the average number of 
participants per day, thereby measuring events in so-called person-days (Rule 
& Tilly 1965, Hibbs 1976, Olzak 1982, Rasler 1986). 

Much research on collective action focuses on collective violence. Exactly 
what constitutes a violent event? And whose violence provides the bench- 
mark—protesters, police, or outsiders, or all three? Again operationalizations 
vary widely. A violent event has been defined as “an instance of mutual and 
collective coercion within an autonomous political system which seizes or 
physically damages persons or object? (Tilly 1978: 248). Others treat events 
as violent if weapons or attacks are’ used merely to threaten property or 
persons (Olzak 1989). Political scientists have measured political violence in 
terms of the number of deaths from political violence (Muller 1985, Weede 
1987). Categories used in coding violence and size are sometimes in- 
terdependent. In some coding schemes, occurrence of violence affects 
whether or not an event is taken regardless of size. For example, all violent 
events are included in some studies of contentious gatherings and protests 
regardless of size (Tilly 1978, Tarrow 1983). 
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DATA AND MEASUREMENT 


Those who analyze quantitative data on events sometimes use data and 
methods without questioning their drawbacks and limitations (Franzosi 1987). 
This section reviews some disadvantages, problems, and constraints encoun- 
tered in event analysis and some proposed solutions. 


Sources and Methods of Data Collection 


Collection of information on date of occurrence, duration, participants, tar- 
gets, and violence relies increasingly on: 


a. official archival records, such as police records and municipal records of 
arrests and/or deaths from civil. unrest (Tilly, Tilly, & Tilly 1975); 

b. annual newspaper indexes (Spilerman 1970, McAdam 1982, Burstein 
1985, Jenkins & Eckert 1986); 

c. daily newspaper accounts (Paige 1975); 

d. published listings of events originally compiled from various newspaper 
sources, ¢.g. the widely used data on strikes in the United States (Griffin 
1939, US Bureau of the Census 1975), lynchings (NAACP 1919, US 
Bureau of the Census 1975), and political turmoil (Taylor & Jodice 1983); 
and 

e. Organizational histories in newsletters or other secondary historical 
accounts (Griffin 1939, Hannan & Freeman 1987, 1989, McCarthy et al 
1988). 


Much research reviewed here has come to rely primarily on newspaper 
reports in part because such sources can be followed forward in time and 
because at least some researchers have found that newspapers provide the 
most complete account of events for the widest sample of geographical or 
temporal units (Tilly, Tilly, & Tilly 1975: 16). Research on the United States 
has treated the New York Times (NYT) as a source for the entire country and 
much of the debate has focused on this source. 

Reliance on newspapers as a primary source of data has its critics. The most 
common criticism leveled at this approach is the suspicion that the data 
contain some systematic biases (Johnson et al 1971, Chermesh 1982, Franzosi 
1987). For example, Danzger (1975) finds that the location of. UPI and AP 
wire offices affected coverage of riots in the NYT, Kielbowicz & Scherer 
(1986) claim that more violent, larger, or longer events are more likely to be 
covered in newspapers and that coverage affects subsequent events. Un- 
fortunately there is rarely a way to evaluate these claims directly since few 
alternative sources contain as much information. However, Snyder & Kelly’s 
(1977) analysis comparing coverage in local and national newspapers shed 
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light on these issues. They compared accounts of riots in daily NYT accounts 
with those from local newspapers in 43 American cities from 1965 to 1969. 
They found that the location of offices of major. wire services was not related 
to the likelihood that an event in a local newspaper would be reported in the 
NYT but that conflict intensity, measured in terms of size, violence, and 
duration of an event did. So Snyder & Kelly (1977) recommended that some 
threshold of the number of participants in events should be used, or that 
events of similar intensity should be analyzed separately when using data 
from the NYT. For problems related to geographical bias that arise when, for 
instance, only the NYT is used, Franzosi (1987) suggests that some alternative 
source—preferably a local newspaper—be used as a supplement. However, 
the NYT remains the most complete source of data on events, according to 
analyses that compared coverage in the NYT with the Los Angeles Times and 
other west coast newspapers, for data on collective actions that occurred only 
on the west coast (Jenkins & Perrow 1977). 


Instruments for Collecting Data on Events 


Charles Tilly and his associates remain the leaders in providing methods and _ 
techniques for collecting information on events of political and contentious 
actions (Rule & Tilly 1965, ‘Tilly et al 1975, Tilly 1978, Tilly 1986, Horn & 
Tilly 1986). Others have adapted his approach for other kinds of events. 
Especially detailed coding appendices and technical reports include those by 
Gamson (1975), Tilly (1978), McAdam (1982), Paige (1975), Tarrow 
(1981), Burstein (1985), Horn & Tilly (1986), and Olzak & West (1987). 
Compendia of events in various countries include those by Botz (1976), 
Kriesi (1981), Sugimoto (1981), and Horn & Tilly (1986). 

Although studies differ in defining event-forms, duration, violence, size, 
and outcomes, two main differences in strategy deserve note. First, Horn & 
Tilly (1986) and Franzosi (in press) developed computer programs that anal- 
yze the content of newspaper data on events. These programs allow research- 
ers to apply methods of content analysis to original newspaper accounts of 
strikes, revolutions, and racial strife using consistent rules. There are trade- 
offs involved in using open-ended programs, however. While they allow 
analysis of actual word counts, nuances, and perhaps also biases: of particular 
reporters or specific newspapers (Franzosi 1987), such programs require a 
large investment in coding time and.computer space. Moreover, using content 
analysis means that a second stage of distilling word counts and evaluation is 
necessary before beginning event analysis. Using shorter and simpler “coding 
sheets” that record data in a fixed format for each event generates easily 
quantifiable and tractable data sets that invite replication and facilitate easy 
comparison. 

A second difference concerns reliance on a newspaper’s annual index or 
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daily accounts. Most studies of race riots and civil rights’ activity used only 
an annual index, usually from the NYT (Spilerman 1970, McAdam 1982, 
Burstein 1985, Jenkins & Eckert 1986). This approach has several draw- 
backs. Even a quick perusal of the NYT index shows that an event can be 
easily confused with a “report.” For example, in practice it is hard to 
distinguish the NYT index entry of “Miami riots” from these alternative 
“events” (a) editorial comments on one or more riots, (b) subsequent stages of 
the same riot, (c) political speeches on a previous riot, or (d) follow-up reports 
such as arrests, court trials, demonstrations following arrests (Olzak & West 
1987). Moreover, the actual timing of an event usually cannot be inferred 
from the index, which is a crucial defect when using event-history analysis. 
Finally, reading and coding a full account of a set of similar events is 
necessary for learning how or even if events are linked. 

The alternative approach uses an index only as a guide to the daily 
accounts, suggesting candidate events and periods of unrest. Master lists from 
an index are sorted by date, location, and event-type, to eliminate 
redundancies.* Further improvements in coding reliability occur when both 
the index and dailies are consulted and cross-checked by different coders 
(Tilly et al 1975, Franzosi 1987, Olzak & West 1987). 


VARIETIES OF EVENT ANALYSIS: RESEARCH 
DESIGNS 


Four research designs have dominated studies of counts in collective events: 


a. Cross-sectional designs record occurrences of events in one period for 
multiple units, as in Paige’s (1975) study of peasant rebellions in agricul- 
tural export sectors; 

b. Time series designs record occurrences of events by period for a single 
unit, as in Snyder & Tilly’s (1972) analysis of collective violence in 
France from:1830 to 1930; 

c. Panel designs (or time series/cross-section designs) record occurrences for 
multiple units in two or more periods, as Spilerman’s analysis (1976) of 
race riots among cities and over time; .and 

d. Event-history designs record the exact timing and sequence of events, as in 
McCarthy et al’s study (1988) of founding rates of anti-drunk driving 
organizations among Cities. 


This is a problem since the NYT index lists events under multiple headings, but several riots 
can occur on one day in a city. Using the index makes it impossible to distinguish whether two 
reports that have the same date, location, and event type refer to the same or to different events. 
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A central research question in event analysis is how much to aggregate data 
on events over time and place. Table 1 classifies the four designs according to 
degree of aggregation by time and place. Cross-sectional designs aggregate 
completely over time. Analyses using this design commonly aggregate in- 
formation on the timing of events and construct a single event-count for each 
locale often for a long period (Paige 1975, Jenkins 1982). In some cases the 
decision to aggregate counts over time reflects the absence of time series data 
on independent variables. Time-series designs record the total number of 
events over all localities or SMOs for each period (so that there is temporal 
aggregation within periods). Panel designs involve less aggregation because 
they record event counts for each unit for each period. Event-history designs 
involve the least aggregation: they retain exact information on timing for each 
locale. Readers should note that this classification into four categories is a 
heuristic device; these dimensions are really continua. l 

Analyses of data on collective events varies along another dimension: the 


TABLE 1 Classification of approaches used in event analysis 


Data Structure 


Cross section: complete 
temporal aggregation 


Single time series: com- 


plete spatial aggregation 


Time series/cross section: 
partial aggregation 


Event History: exact timing 





“Logit regression can also be considered as process oriented. See text. 


Nonprocess Orientation 


Classical regression 
(Paige 1975) 

Logit regression? 
(Markoff 1985; McAdam 
1986) 

Time series regression 
(Snyder 1975, Jenkins & 
Eckert 1986) 


Two wave panel with 
lagged dependent vari- 
ables used as a control 
(Muller 1985, Weede 
1987) 


Process Orientation 


Poisson modeling 
(Spilerman 1970; Gold- 
stone 1980) 


Time-series regression with 
lagged dependent vari- 
ables ` 
(Hibbs 1976, Kelly & 
Issac 1984) 

Poisson regression 
(Hannan & Freeman in 
press) 

Panel analysis used to es- 
timate dynamic process 
(Spilerman 1976; Olzak 
1982) l 


Single point process 
(Olzak 1987) 

Models of transition rates: 
multiple units 
(Hannan & Carroll 1981; 
McCarthy et al. 1988; 
Liebman et al. 1988) 
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extent to which event analysis takes a process orientation. A process-oriented 
approach evaluates how time and covariates (including the.occurrence of past 
events) affect the timing and sequence of repeatable events, and it attempts to 
explain how events unfold over time (dynamics). In contrast, other ap- 
proaches, called “nonprocess” here for lack of a better label, summarize static 
relationships between levels or characteristics of units and event counts. 
Nonprocess approaches thus relate levels of unit-characteristics (aggregated 
with respect to time or place) to some unit-specific or period-specific event 
count. As in the case of the aggregation, this dimension too is a continuum.* 

When past applications are considered, the process dimension can also be 
summarized with respect to a choice of a probability model. This choice has 
substantive consequences as well. Nonprocess approaches have commonly 
employed classical regression models which assume a normal distribution, 
while process models have employed discrete-state stochastic process models. 
Most research on events uses the former. However, data on event counts have 
two characteristics that make the assumption of normality suspect: event 
counts are non-negative and discrete. Estimation techniques that assume a 
normal distribution have been used more as a convenience than because they 
were appropriate.” Now that software for estimating discrete-state stochastic 
process models is widely available, reliance on techniques based on normal 
distribution theory is likely to diminish. 


Cross-Sectional Analysis 


A variety of studies of collective action use methods that assume a normal 
(Gaussian) distribution to estimate the effects of community characteristics on 
variations in numbers of events. In cross-sectional applications this involves 
relating constant community characteristics to frequencies of events, aggre- 
gated over time to produce a total count for each community. This approach 
has been used in studies of strikes (Britt & Galle 1974, Lincoln 1978), 
lynchings (Inverarity 1976, Tolnay et al 1989), peasant rebellions (Chirot & 
Ragin 1975), organizational success (Steedly & Foley 1979), and urban 
protests (Eisinger 1973). Techniques used to assess the effects of covariates in 
these studies include correlation analysis (Jenkins & Perrow 1977), regression 


“Specifically, time series and panel approaches that estimate dynamic models (models with 
lagged dependent variables) can be given a process interpretation (see Table 1 for examples). 

“While counted data never strictly satisfy the assumptions of non-negativity and continuity, 
this does not make much practical difference when counts are large, as in the case of number of 
strikes per year in the United States. In practice, violation of this assumption of non-negativity 
has been circumvented by logging the counts of events or by adding 1.0 to the counts (see Snyder 
1975). For most studies reviewed here this has not been done in either time series or regression 
analyses. 
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analysis (Jenkins 1982), multiple classification analysis (Corzine et al 1983), 
and confirmatory factor analysis (Inverarity 1976). 

Studies of participation in social movements ‘generally also use cross- 
sectional data. As mentioned above, analysis of surveys of only participants 
in efforts to learn the causes of participation produces sample-selection bias. 
At least five studies avoided this problem by using data on samples of 
individuals who did and did not participate in riots (Paige 1971, Morgan & 
Clark .1973, Ladner et al 1981), anti-busing protests (Useem 1980), and 
vandalism during riots (Berk & Aldrich 1972). 

Some investigators have collapsed information on event counts into binary 
distinctions between “no event” and “one or more events” (Markoff 1986) or 
“participant” and “non-participant” (McAdam 1986). These analysts used 
logit regression which is more efficient than standard linear regression es- 
timates.° The gain in efficiency is important when the fraction of units with an 
event is small in number compared to the period of time at risk or the number 
of units without an event. . 

Used with care, cross-sectional designs maximize variability in structural 
properties that affect the occurrence of events. However, there are drawbacks 
to applying cross-sectional designs. to data on events that occur over time. 
Specifically, this approach has largely ignored the process by which events 
unfold, that is, whatever affects the nature, timing, and sequence of subse- 
quent events. A concrete research question illustrates this point. Does use of 
violence by the police early-in a riot affect the likelihood of more violence on 
the part of demonstrators? Such questions direct attention to sequences or 
stages in an event as well as to the timing of subsequent events, and this 
requires models and estimation techniques with a process orientation. 

But cross-sectional data can be used to estimate parameters of some simple 
process models. In an important example, Poisson models have been used to 
analyze cross-sectional data on the propensity of cities to riot (Lieberson & 
Silverman 1965, Spilerman 1970). Such analyses were motivated by the 
question asked in cross-sectional regression analyses: Do cities (or SMOs) 
differ with respect to some outcome? In these cases the answer was that 
heterogeneity affected the likelihood of an event. For instance, patterns of 
occurrences of race riots and organizational success were found to depart from 
a model of a constant rate of occurrence. Spilerman’s (1970) findings ran 
counter to the conventional wisdom about the relationship between poverty 
and the occurrence of riots (e.g. US National Advisory Commission on Civil 


°As in the cases of time series and panel analysis discussed below, logit regression can be used 
as either a non-process or process approach. For example, Tuma & Hannan (1984: 298) point out 
that a logit model could. be considered as an equilibrium implication in particular. However, 
research reviewed in this section has not considered logit regression in this way. 
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Disorders 1968). By using these more appropriate methods to analyze both 
riot and non-riot cities, he found that “racial violence is more likely where 
Negroes are better situated in occupational status, in education and income, 
and where the rate of population growth is small” (Spilerman 1970: 645). 


Single Time Series 


The time series design highlights patterns of changes over historical time. 
When theoretical or historical reasons suggest that structural conditions differ 
among periods, analysts can estimate shifts in parameters at predetermined 
times. Time series analysts have captured historical variation in three ways. 
One estimates a time trend (Snyder & Tilly 1972). A second estimates models 
separately by period to check whether causal structures differ by historical 
period (Wallace et al 1988). A third approach combines the two; it estimates 
an interaction between covariates and either a time trend or a set of periods 
(Snyder 1975). 

Time series analysis has been common in research on collective events in 
diverse settings and historical periods: French collective violence, 1830-1960 
(Rule & Tilly 1972, Snyder & Tilly 1972, Lodhi & Tilly 1973, Tilly 1986); 
American lynchings of blacks (McAdam 1982), American urban racial vio- 
lence, 1948—1979 (Kelly & Isaac 1984), American black social movement 
activity, 1950-1980 (McAdam 1982, Jenkins & Eckert 1986), and American 
domestic violence of all forms, 1890-1970 (Rasler 1986). 

The vast literature on strike frequencies contains numerous examples of 
time series analysis of event counts (Griffin 1939, Ashenfelter & Johnson 
1969, Hibbs 1976, Edwards 1981, Rubin 1986). Analyses of strike frequen- 
cies in the United States and elsewhere typically estimate effects of measures 
of economic well-being, political climate, and institutionalization of the labor 
movement or its organizations (Shorter & Tilly 1974, Snyder 1975, Wallace 
et al 1988). 

In principle time series analysis can be used to analyze processes that 
underlie the flow of events. For example, Hamblin et al (1973) use time series 
analysis to analyze processes of diffusion or contagion. Tuma & Hannan 
(1984: Part III) show that time series methods can be used to analyze process 
models for quantitative outcomes (such as large counts). However, the re- 
search reviewed here has rarely used time series in this way. Instead much 
time series analysis has treated time series data very much like cross-sectional 
data, simply relating counts per period to levels of (or changes in levels of) 
covariates in each period. 

However, when analysts include lagged values of the dependent variable as 
a regressor, the model is implicitly dynamic. That is, such models imply a 
time path of change in the dependent variable even if the regressors do not 
change over time. When analysts develop the dynamic implications of such 
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models—as in Hibbs’ (1976) use of adaptive expectation models to explain 
temporal variation in strikes—time series analysis can be considered an 
instance of a process orientation. 

Time series data can also be used to estimate parameters of discrete-state, 
continuous-time stochastic process models such as the Poisson. Poisson 
regression has been used to estimate the effects of social and economic 
conditions on the rate of occurrence of events using yearly counts in a single 
time series. For example, Hannan and Freeman (1989) used Poisson regres- 
sions (as well as event-history methods) to relate the number of labor unions 
to founding rates of unions. This research shows that the dynamics of social 
movements can be understood by focusing on (a) the internal histories of 
SMOs and (b) dynamics of the interdependent events that make up the cycles 
of social movements. 


Cross-Sectional and Time Series Analysis 


Use of both cross-sectional and longitudinal variation (panel waives in 
events represents a third approach (see Table 1). When information on events 
is available for two or more time periods on multiple behavioral units (cities, 
organizations, participants, etc), several estimation procedures can exploit 
more available information than does either cross-sectional or time series 
analysis alone. For this reason, panel analysis is particularly advantageous 
when analyzing rare events, such as revolutions. 

Panel designs have been used frequently in cross-national analyses of 
political instability and collective violence. Two common methods of estima- 
tion are: (a) multiple regression analysis for two-wave panel data and (b) 
pooled cross-section and time series estimators that adjust for autocorrelation 
of disturbances. The number of time intervals constrains the choice between 
the two because the second approach requires data for three or more periods. 
Two-wave panel analysis has been used in studies of income inequality and 
political violence (Muller 1985, Weede 1987). Multiwave panel analyses 
have been used to analyze severity of race riots in cities (Spilerman 1976) and 
the intensity of separatist violence and demonstrations in Quebec (Olzak 
1982). The latter two studies tested hypotheses that past occurrences of events 
as well as constant structural and demographic characteristics of cities or 
states affect rates of unrest. 

Almost all applications of panel analysis to events and social movements 
include lagged values of the dependent variables as regressors. For example, 
studies of political violence in a period usually include as a regressor the level 
of political violence in some earlier period. Often the inclusion of lagged 
dependent variables is motivated as a “control” variable in the spirit of 
Lazarsfeld. The estimated effects of such variables are then rarely given a 
substantive interpretation, and if one is given it rarely involves process or 
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dynamics (such applications are therefore listed in the left column of Table 1). 
Other analysts begin with process models and interpret ‘the effects of the 
lagged dependent variable as revealing something about the underlying 
dynamic process (and these are listed in the right-hand column of Table 1). 


Event-H istory Analysis 


Event-history analysis has been used recently to study rates of occurrence of 
collective actions, such as riots and protests, as well as the establishment and 
disbanding of SMOs. This approach involves application and generalization 
of methods originally designed, for studying events in biostatistics and in- 
dustrial engineering. This class of methods uses information on the timing and 
sequencing of events to estimate models of transition rates. Two general 
forms of event-history analysis are relevant to the study of collective action 
and social movements. The first involves study of recurrent events, such as 
riots, where the typical duration of an event is small relative to the waiting 
time between events. In this case the attention focuses on the rate of occur- 
rence, that is the transition from an event count of N to N + J (Coleman 1981, 
Amburgey & Carroll 1984, Amburgey 1986, Hannan & Freeman 1987). The 
second considers transitions between enduring states, such as different forms 
of political regimes (Hannan & Carroll 1981). The second approach can be 
used for studying transitions among phases in the histories of demonstrations, ` 
revolutions, and social movements (Tuma & Hannan 1984). 

Event-history analysis is particularly useful for analyzing cycles or waves 
of protest and violence. As noted above, while the concept of cycles of social 
movement activity is not new, it has rarely been investigated empirically. Use 
of event-history analysis facilitates study of diffusion of events. Second, 
questions about whether contagion is restricted to particular forms or stages in 
collective protest can be investigated with event-history analysis as well. For 
example, questions about whether Civil Rights’ protests spark subsequent 
racial conflicts, or vice-versa, can be investigated with analyses that estimate 
the transition rates among multiple states. | 

Like panel analysis, event-history analysis can use information on the 
timing of events in a series of localities (or SMOs) to estimate models that 
take both unit-specific characteristics and the timing of events. In contrast to 
the single point process models described above, examples of this combined 
cross-sectional and longitudinal design commonly analyze events in a number 
of cities, SMSAs, or some other subunit over a number of observation points. 
This approach combines the advantages of using both cross-sectional and 
temporal variation with the appropriateness of stochastic models that analyze 
events as outcomes of processes that unfold over. time. 

Event-history analysis uses more information about the transitions or se- 
quences in collective action than do other methods. Consider the case of 
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causes of national coups d’état and revolutions, which have long interested 
social scientists (Feierabend et al 1972). Hannan & Carroll (1981) compared 
panel analysis and event-history analysis in changes of national political 
structure. The panel analysis relates regime type (no party, military, one- 
party, and multiparty) at one time to regime type 25 years before in order to 
infer change. Event-history analysis was used to consider all transitions 
among the four .regime-types. The two kinds of analysis come to different 
conclusions. So event-history analysis adds new evidence to the debate over 
causes of political upheaval. 

Event-history methods have been applied to studies of founding and mortal- 
ity rates of SMOs in communities. For example, McCarthy and associates 
(1988) studied founding rates of anti-drunk driving organizations in counties 
and learned that community characteristics such as rural/urban character and 
population available for recruitment affected the founding rate of these 
SMOs, as did grievance-driven characteristics often cited as the impetus for 
organizing social movements. Others have used these methods to study 
processes of failure or dissension in social movements. For instance, Carroll 
& Huo (1988) found that success of local Knights of Labor organizations in 
city elections increased rates of disbanding of these locals. Liebman et al 
(1988) analyzed schisms in American Protestant denominations, using data on 
membership, organizational structure, and histories of splits within de- 
nominations. The line of research on the internal histories (including creations 
and disbandings) of SMOs provides new and highly flexible methods for 
studying various outcomes for such organizations. 


FUTURE CHALLENGES TO EVENT ANALYSIS 


As event analysis increasingly uses data from historical archives and news- 
paper records, the methods of historians and sociologists converge. This trend 
poses several challenges to event analysts. One challenge is to develop 
sociological approaches to exploiting these historical materials that are sensi- 
tive to the historical context, as Tilly (1981) notes vigorously. For example, 
recent studies by Tilly (1986), Markoff (1986), and Goldstone (1986) contain 
imaginative operationalizations and tests of historical explanations of con- 
tentious actions. The next task is to link historical arguments to existing 
sociological theories or to develop new theories to fit them, and to clarify 
historical arguments using sociological methods. 

Another challenge involves improving methods for gathering data on 
events. An abundance of coding manuals and appendices attest to the efforts 
applied to improving data collection, but surprisingly few studies report 
attempts to use alternative sources and ways to validate the measures. If the 
goal of event analysis is to provide better understanding of how collective 
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actions unfold, then such challenges should be met. Based on the success and 
diversity that has characterized event analysis thus far, it makes sense to 
encourage progress along these frontiers as well. 
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Abstract 


This paper undertakes a selective review of theoretical and empirical studies 
of internal labor markets (LMs). Three different conceptualizations of ILMs 
are identified in existing literature: (a) ILMs as all jobs within a firm; (b) as a 
variable describing firms or present in discrete clusters of jobs within firms; 
(c) and a phenomenon present in some occupational labor markets within and 
across firms. Some empirical research using each conceptualization is de- 
scribed. Evidence apropos several arguments about the theoretical origins of 
ILMs is assessed and some topics of future research are indicated. This review 
documents the absence of consensus about the characteristics defining the 
ILM concept; a resulting diversity of approaches to the measurement or 
location of ILMs further’ hinders both accumulation of findings and the 
comparative study of ILMs. The most promising of the theoretical origins of 
ILMs relates scarcities of highly skilled workers to job structures generating 
increasing skill and knowledge among workers. 


INTRODUCTION 


About ten years ago, a new literature on the sociology of labor markets 
emerged (Kalleberg & Sørensen 1979) as part of the “new structuralism” with 
its interest in dual economy theory and dual labor markets (Hodson & 
Kaufman 1982, Baron 1984). Within this literature, the concept of an internal 
labor market (ILM) has attracted the research interests of many sociologists 
(Kalleberg 1988). 

An intellectual debt to earlier work by economists is undeniable: for the 
ILM concept itself (Kerr 1954) and for such basic references as Doeringer & 
Piore (1971) and Thurow (1975). While work by economists on this topic 
goes forward (Osterman 1984a, 1987; Williamson 1985), the research of 
sociologists reflects a special concern with the institutional, organizational, 
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and occupational settings of labor markets including ILMs. This paper offers 
an assessment of our present knowledge of ILMs based on studies of ILMs in 
the United States and in other countries.’ 

Several questions will be addressed. First, how have ILMs been con- 
ceptualized and measured or located, and what findings aptly illustrate the use 
of each conceptualization? Second, what have we learned about the viability 
of several proposed theoretical origins of ILMs? Finally, in what directions 
does further research seem needed or promising? 

To anticipate some of the findings of this review, I show that no consensus 
exists as to the defining characteristics of the ILM concept and that, not 
surprisingly, an incredible variety of measures or indicators of ILMs has 
resulted. Among the theoretical arguments as to the origin of ILMs reviewed, 
the one relating the scarcity of skill to the structured usage and increase of 
employee’s skill and knowledge within ILMs seems the most promising. In 
view of the types of extraordinary data sets currently under analysis by Baron, - 
Mittman, Davis-Blake, Peterson, Spilerman, and others, and an enticing 
cornucopia of both quantitative and qualitative, historical, and theoretically 
developed works reviewed below, students of labor markets have much to 
look forward to in the next ten years. 


CONCEPTUALIZATIONS AND MEASUREMENTS 
OF ILMS 


There are several distinct conceptualizations of ILMs to be found in the labor 
market literature: 


ILM. Is All Jobs in a Firm 


In the first conceptualization, all of the jobs within an establishment or firm, 
viewed as a relatively homogeneous whole, are considered an internal labor 
market. To recall Osterman (1984a:166): 


Students of ILMs generally take the entire firm as their unit of analysis. The implicit 
assumption is that firms are unitary in their central employment characteristics and hence it 
is appropriate to speak of the ILM of a given enterprise. 


In many of the works in this group, an ILM is conceptually anchored in 
administrative rules governing hiring, promotion, layoffs, and the pricing of 


‘In view of space limitations, my reviews of literature are selective in the following ways. I 
use the distinction between the broader concept of primary labor markets and the narrower 
concept of ILMs, and I neglect works that explicitly focus only on the former. I also neglect 
works that may use the phrase “internal labor market” in their titles but are primarily about other 
topics, e.g., organizational demographic structure, or wage determination within organizations. I 
do not consider some works that may well be about ILMs but that do not make use of the ILM 
concept, ‘or. works that may indirectly address theoretical features of ILMs such as those on 
implicit contracts, efficiency wages, or rank-order tournaments. 
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labor (Doeringer & Piore 1971:4—5; Burawoy 1979; Gordon et al 1982; 
Granovetter 1986; Spilerman 1986). Other works equate ILMs with bureau- 
cratic labor markets, bureaucratic control systems, and with the operations of 
job classification systems (Grandjean 1981; Littler 1982:19; Jacoby 1984; 
Stark 1986:493 with respect to the United States ILMs; Cole 1979 with 
respect to Japanese ILMs; Streeck 1984, Gabriel & Holzapfl 1981, viewing 
West German firms; and Michon 1981, looking at French firms). 

On this conception, ILMs are not explicitly measured among the jobs or 
establishments of interest; rather a single ILM is assumed to be globally 
present within plant or establishment boundaries. This is related in part to the 
ubiquity of bureaucratic or administrative rules and in part to the absence of 
data on which to base ILM measures (see, for example, Gordon et al 1982: 
193). 

Because ILMs are more often assumed present than measured, some of the 
works placed in this first category do not typically yield “findings” about 
ILMs in the same sense as those in other categories. One exception is 
Felmlee’s (1982) unexpected discovery that within-firm job shifts exhibited 
positive duration dependence (shifts become more likely with growing tenure 
in a job), while shifts between employers were found less likely with growing 
job tenure, as in earlier works by Sørensen and Tuma (1981) and others. 


ILM Is Not All Jobs in a Firm 


ILM AS A VARIABLE In the second conceptualization, one or more ILMs are 
associated with some but not all jobs within establishments or firms. There are 
two approaches taken here: the first develops and analyzes explicit measures 
of the degree to which JLMs are present at the establishment level of analysis; 
at the job level of analysis, the’ conditional likelihood of jobs falling within 
ILMs is also analyzed. This approach requires uncommon and valuable data 
sets covering many establishments and the jobs within these establishments. 
In this category fall the work of Pfeffer & Cohen (1984), Baron et al (1986), 
Cohen & Pfeffer (1986) and Mittman (1986), among others. 

Working with a Bay Area sample of establishments, Pfeffer & Cohen 
(1984) created an establishment level measure of the presence of ILMs based 
on seven items related to promotion ladders within establishments.” Among 


Pfeffer & Cohen measured the presence of ILMs in establishments by giving one point each 
for the following items: (a) it followed an established policy of promotion from within; (b) 
promoted most employees with at least five years of service at least once; (c) filled all or almost 
all jobs from within; offered entry-level jobs with at least some promotion potential and requiring 
no prior work experience either to (d) those with no more than high school education or to (e) 
those with junior or four-year college degrees; and in manufacturing establishments only, (f) 
frequently promoted unskilled laborers to semiskilled jobs and (g) frequently promoted semi- 
skilled laborers to skilled jobs. The total score was the sum of these points divided by the number 
of items with no missing values. 
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nonmanufacturing organizations, the ILM measure was stronger, the larger 
the size of the establishment and the greater the degree of on-the-job (OJT) 
training. It was stronger for establishments that were located in the core 
economy and that had a personnel department. The ILM measure was weaker, 
on the other hand, the greater the degree of unionization. The degree of 
vestibule training and out-service training showed no effects on the measure. 
Among manufacturing organizations, the only significant variable was union- 
ization; this remained negative and was even more strongly related than for 
nonmanufacturing establishments. 

Using a subsample of 100 California establishments, Baron et al (1986) 
analyzed determinants of ILMs measured at both the establishment and job 
level.’ The results for both levels together indicate that [LMs were present to 
a greater degree, promotion ladders were-more likely, or a job was more 
likely on a promotion ladder, when the establishment was large or when the 
establishment was linked to a larger organization. Promotion ladders were 
more likely, or a job was more likely to be on a ladder, as the degree of 
firm-specific training increased. 

However, professional jobs, union jobs with greater firm-specific training, 
jobs held exclusively by women, and jobs dealing with more complex levels 
of data were less likely to be on promotion ladders. Among all jobs on 
promotion ladders, moreover, jobs less likely to be entered from the outside 
included managerial jobs, union male jobs, and jobs with more firm-specific 
training. Jobs with greater prospects of further upward mobility on ladders 
included professional or technical jobs, jobs with greater numbers of in- 
cumbents, and jobs with less firm-specific training. 


ILMS AS DISCRETE SUBSETS OF JOBS WITHIN FIRMS In the second 
approach to the conceptualization that not all jobs within a firm fall into 
ILMs, ILMs are associated with discrete subsets or clusters of jobs (or of 
workers) within establishments or firms. The difference between works in this 
and the first approach is that here some discrete groups of jobs within 
organizations or industries are studied, e.g. sales or clerical employees, 
commercial lenders, crane operators. Examples are found in job ladders 
within the steel industry, historically (Elbaum 1984), job-rotation systems in 
large Japanese firms (Koike 1983, 1984), the careers of managers in high 
technology firms (Kanter 1984), job ladders for associate members of large 
law firms (Wholey 1985), job ladders among laboratory workers in chemical/ 
oil firms (Schroeder & Finlay 1986), both informal groups and legalized 


3Two measures at the establishment level were used: the first was based on six characteristics 
of an establishment’s promotion system; the second was a dummy variable whose score reflected 
whether or not the organization had jobs on promotion ladders. At the job level, a dummy 
variable score distinguished between jobs on or off a promotion ladder. 
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subcontracting units within (Hungarian) “socialist ILMs” (Stark 1986) and 
labor markets within McDonald’s (Parcel & Sickmeier 1988). 

In the more quantitative: works using this conceptualization, statistical 
indicators are used, more to locate and characterize distinct groups of jobs as 
part of one or more types of ILMs than to measure the degree of an ILM 
presence. Examples include a work which uses transition matrixes of mobility 
patterns to describe ILM structures (“occupational career ladders”) within 
clerical, sales, and manager-professional-technical jobs in a large insurance 
company (Mahoney & Milkovich 1971), and the finding of positive duration 
dependence in job shifts (White & Althauser 1984). Dalton & Snelling (1983) 
. detect the presence of ILMs in the significance of a regression coefficient 
representing statistical interaction between income predictors like having a 
doctorate and being in an older or more recent entry cohort of college faculty. 
Osterman’s treatment (1984b) of “‘white collar ILMs” uses percentages of 
firms having top or middle jobs in a job ladder commonly entered from 
outside of the firm or of employees entering from other ladders inside the 
firm. 

Historical treatments of the rise of ILMs, either in their own right or as part 
of other corporate and industrial change, are commonplace, including Ed- 
wards 1979, Gordon et al 1982, and several works in Osterman’s Internal 
Labor Markets volume: Jacoby 1984, Kochan & Cappelli 1984, and 
Elbaum’s (1984) piece on the iron and steel industry. While other accounts 
concentrate on the rise of personnel departments and the administration of 
bureaucratic rules as ILM equivalents, Elbaum focuses on the development of 
internal promotion practices going back to 1866. There, the highly skilled 
sought to preserve their contro] of internal promotion rights, while they 
opposed the lesser skilled heipers who demanded access to their jobs. The 
craft union subsequently formed of puddlers and other skilled groups contin- 
ued the battle against the helpers into the early 1870s. In another decade this 
union had absorbed the helpers (responding to competition from the Knights 
of Labor) and now supported internal promotion (by seniority). These union- 
supported internal promotion practices outlasted the union’s defeat in the 
Homestead lockout of 1892 and were in essence continued by management. 
Promotion-based ILMs provided bargaining leverage over wages. Thus pro- 
tected from competitive market conditions, wide occupational wage differen- 
tials across diverse departmental job ladders resulted, following patterns 
Elbaum finds contrary to both radical and neoclassical theory. 

Carter & Carter (1985) identify three discrete ILMs among six subgroups of 
male and female, sales and office clerical workers in two department stores, 
Hearn’s and Lord & Taylor’s (circa 1913). The first—the female sales 
workers at Hearn’s—is theoretically interesting because none of the con- 
ditions for the formation of ILMs which the authors have reviewed are 
present: no firm-specific skills, labor unrest, OJT by other workers, or 
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deskilling. These women lack power, earn low wages, and yet are promoted 
within Hearn’s, suggesting that this ILM evolved out of promises of “regular 
future promotions as a substitute for payment of market-level wages” (pg. 
593). : 

The second ILM consists of female clericals at Hearn’s, whose work is 
somewhat rationalized, routinized, and deskilled. Yet rigid internal promo- 
tion policies are observed. Still missing as originating conditions are firm- 
specific ‘skills and labor unrest. In the third ILM found among the male 
clericals at Lord & Taylor’s (the high-wage employer of the two), less 
rationalized and more firm-specific idiosyncratic accounting procedures are 
used. This fact generates an unusually large number of skilled, male clerks 
who provide OJT to other, Junior, male clerks, and who are supported by 
less-skilled women workers. This ILM is different in one other way as well: 
Lord & Taylor’s hires some outside recruits away from the best of the local, 
experienced clericals in other firms (as it did from the best of Hearn’s sales 
personnel) even while using its ILM to meet the rest of its need for skill. 


Occupational Internal Labor Markets (OILMs) 
In this conceptualization, as in part in the one above, the ILMs are associated 
with discrete subsets or clusters of jobs. What is distinctive here is that these 
are found either within occupational categories or across related occupational 
categories; and they are found both within and across establishments or firms. 
Because of the strong association of the ILM concept with jobs within firms, 
this is the least explored of all ILM conceptions. The recent ancestry of this 
category of ILM can be found in Doeringer & Piore’s often neglected 
treatment of ‘craft or occupational’ types of ILMs and in Althauser & 
Kalleberg’s (1981) conception of occupational ILMs (OILMs). Works 
illustrating this conception include Smith’s (1983) study of job shifts among 
basketball and football coaches at various levels of college athletics. 
Within manufacturing at the establishment level of analysis, Baron et al 
(1986, p. 261) found that managerial, professional, and technical jobs are 
more likely to span establishments than to be confined to one. Later in the 
same work, they find that the more technically complex jobs are just signifi- 
cantly more likely to be on job ladders; even more interesting, they are easy to 
enter from the outside. The authors remark that: 


Many of the jobs were in ladders used as a training ground for the pertinent skills. 
However, due to the technical nature of the jobs, establishments often also relied on 
external sources of workers, either to deal with increased demand, to take advantage of 
favorable labor market conditions, or to offset difficulties associated with long internal 
training times. (Baron et al 1986, pp. 266) 


Another example with OILM features is the Lord & Taylor’s male clerical 
group described in Carter & Carter (1985); clerical skills were both internally 
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developed by more experienced employees and supplemented by. the external 
pirating of people with comparable skills. Dale’s (1987) study of job and 
employer changes across occupational groups using a large Labour Force 
Survey of employed civilians in the United Kingdom utilizes both this and the 
second (B2) conceptualization. 

Reminiscent of Elbaum’s account, Finlay (1983) describes a 1978 labor 
contract that has left the occupation of West Coast crane operators socially but 
not technically differentiated into two distinct post-contract labor markets: 
higher skilled, certified men (steadily employed by a single employer or from 
a union hall), and the lesser skilled noncertified men. This result is a para- 
doxical product of a difficult choice of opposing goals within the union and a 
bargained compromise between the employers’ association and the union. 
The employers have acquired the union’s assent to limiting on-going access to 
the ranks of the certified crane operators. In.so doing the unions have shut 
down employer-driven ILMs which formerly trained noncertified workers to 
become certified, thereby reducing the competitive threat to the certified 
operators. Employers may no longer hire new, noncertified operators along 
with certified operators; but they continue the past practice of lowering their 
overall wage bill by using now vulnerable, noncertified operators hired before 
the 1978 contract. These men may continue to work steadily as long as they 
do not leave their 1978 employers. So a residual pre-contract dual market also 
remains; and at the same time, the closure of access to the ranks of the 
certified has engendered pirating of men who are lured into changing employ- 
ers. 

Among economists, Dalton & Snelling’s (1983) mixture of closed and open 
ILMs characteristic of academic labor markets, Ryan’s (1984) discussion of 
shipyard welders as an example of “costly transferable skills,” and Oster- 
man’s (1987) craft subsystem ILMs illustrate this conceptualization. In a 
study based on interviews with managers of twelve white-collar Boston area 
firms, Osterman (1984b) illustrates the existence of three discrete subsystems 
or ILMs within firms (industrial, craft, and secondary) by contrasting the 
types of openness of ladders reported for sales, managerial, computer, and 
clerical jobs. Thus, industrial ILMs (managers) show little entry from the 
outside, and access from other types of jobs within the company is confined to 
low “entry” jobs. His craft ILMs (computer programmers) show no entry 
from within the firm above the entry job, but easy access to middle and top 
jobs in the ladder from outside the firm is apparent. Secondary ILMs (cleri- 
cals) enter all levels of the ladder from outside the firm and from other ladders 
in the firm. . 

Osterman views the association of particular ILMs with particular occupa- 
tions or jobs as a matter of an employer’s choice; and this choice is best 
understood by looking at instances and sources of changes in earlier associa- 
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tions. Hence a changing product market and regulatory environment changed 
the labor market for commercial bank lenders from an industrial to a craft 
type; yet, an older pattern of banks providing training persisted in the face of 
banks hiring away lenders from each other (1984b:179). Or (Osterman 
1984b:180-81) changes in labor market conditions (shortages, craft market 
driven wage increases) have led firms to attempt to supplant externally trained 
staff with employees given in-house, albeit skill-truncated, training (with 
computer programmers as an example). 


A CONCEPTUAL POTPOURRI? 


It would be hard to think of another concept that has been measured or located 
by means of so many different approaches! Whatever the substantive variation 
suggested by the four conceptualizations above, even more variation in 
defining elements becomes apparent if we recall some of the key theoretical 
characteristics associated with ILMs. For example, following either human 
capital (Becker 1964, 1985) or institutional theorists (Doeringer & Piore), 
some works concentrate their attention on the elements of firm-specific skills 
(FSS) and training (Ryan 1984, Carter & Carter 1985, Baron et al 1986, 
Pfeffer & Cohen 1984), on-the-job training (many of the same works just 
cited), job ladders and promotions (Carter & Carter 1985, Baron et al 1986, 
Althauser & Kalleberg 1981, 1989 in press), or on increments of skill and 
knowledge concurrent with job movement (White & Althauser 1984, Baron et 
al 1986). 

The qualitative contexts in which ILMs originate are also varied. Recall the 
IL.M—Hearn’s female sales workers—in Carter & Carter (1985) which had 
no origin other than to control workers with a finely graded job and pay 
ladder. Or contrast the two types of ILMs in Edwards (1979:18 1-83): the first 
is the product of a “compromise” model of ILMs, found amidst the “technical 
control” of “subordinate primary market” segments; the second (1979:176, 
182-83) is a bureaucratic ILM for professionals and managers born of “in- 
dependent primary control.” Or recall Finlay’s (1983:311—12) examination of 
employer-controlled ILM for the training of noncertified crane operators, 
relinquished in the 1978 contract between the union and the employers 
association. 

Perhaps the most challenging examples of the varied usage of the ILM 
concept comes in a cross-national context as, for example, in discussions of 
Japanese, US, and socialist (Hungarian) ILMs. In Koike’s account of a large 
Japanese steel company’s ILM, a system of on-the-job training and frequent- 
to-irregular occasions of job progression or rotation are scheduled by the 
workgroups themselves, and to a lesser extent by foreman and subforemen 
(1983:42-47). In Stark (1986), groups of engineers, technicians, skilled or 
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semi-skilled workers working for Hungarian firms bid for internal sub- 
contracts from employers, to be completed after hours within the employers’ 
plants. 

The variation in the structural character of ILMs in these very different 
institutional or national contexts again reminds us of the usual problems in 
comparative research of achieving conceptual equivalence (Kohn 1987:720). 
The difficulties of exporting concepts are legion when, to begin with, there is 
fair consensus on the meaning of the concept in at least one culture (e.g. the 
different meanings of being a worker or manager, as noted by Maurice et al 
1984:249). Here the comparative task is further complicated by the absence of 
such consensus. i 

Some greater conceptual order might seem possible through the differentia- 
tion of multiple types of ILMs, including different types of ILMs within firms 
(FILMs) as well as occupational (as distinct from firm) ILMs. However, for 
this avenue of. clarification-to be useful, some theory-driven adjudication of 
alternative typologies is needed in the long run. Some progress in addressing 
the theoretical origins of ILMs would be useful in this regard. 


THEORETICAL ORIGINS OF ILMS 


One of the consequences of the conceptual diversity we have witnessed 
above is that the cumulation of knowledge about the theoretical origins of 
ILMs is more difficult than if a single conception held sway. Moreover, no 
correspondence has emerged between different types of ILMs however dis- 
tinguished (e.g. blue-collar or manufacturing, white collar, craft) and either 
distinctive origins of ILMs or factors reproducing already established ILMs. 
This discussion examines the current status of several distinctive accounts, 
assumptions, and factors related to the origins of ILMs. 

After some theoretical dissent (reviewed in Althauser & Kalleberg 
1981:137-40), and more recent theoretical and empirical work (Gordon et al 
1982:200-204; Pfeffer & Cohen 1984:567--68; Baron & Bielby 1984:466- 
68), the presumption (associated by many with dual economy theory) that 
ILMs are simply and invariably a derivative feature of core-economy organi- 
zation is becoming less widely held. The relationships found between markets 
and economic sectors, ranging from modest to nonsignificant, depending on 
the measures used, have never matched the strong expectations of this early 
theoretical presumption. 

Some historical accounts of ILM origins suggest that ILMs have emerged 
as part of struggles between production-line, union, or professional workers, 
and employers (Jacoby 1984, Elbaum 1984, Bills 1987:217). This has been 
supported by several accounts (Cole 1979:104, 111 and elsewhere; Gordon et 
al 1982:187f; Elbaum 1984; Jacoby 1984; Bills 1987:217). Labor market 
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arrangements in general and not ILMs in particular may well be products of 
such struggle as Finlay’s analysis of two labor markets within one occupation 
suggests. The principal significance of these accounts is that they make 
apparent some limitations of earlier radical and neoclassical treatments (see 
Elbaum 1984 as an example). What remains unclear about this theoretical 
account is its scope. Does it apply to white collar ILMs for example? 
Several historic accounts associate the spread of ILMs with the develop- 
ment of personnel departments, formalized rule structures and the rise of 
bureaucratic control (Jacoby 1984, Baron et al 1988). A subset of these link 
this spread to the deskilling of work, viewing deskilling as a cause of 
spreading ILMs or a companion of larger changes (Stone 1975:45-58, esp. 
55-58; Burawoy 1979:107-8). Yet studies viewing ILMs as embedded in 
policies of internal promotion and job ladder structures show that some ILMs 
preceded the onset of these bureaucratic control systems and rules (Doeringer 
& Piore 1971:54,n. 28; Jacoby 1984:55; Elbaum 1984; Carter & Carter 
1985). In some of these latter examples, union support was involved; in 
others, passing union opposition or the absence of a union role is apparent. 
Several accounts stress the role of selected features of the workplace as 
proximate causes of ILMs. At the outset of their classic work, Doeringer & 
Piore (1971) listed three causes of ILMs, two of which have attracted research 
interest: specificity of skill, and technology; and on-the-job training. 
Enterprise or firm-specific skills (FSS) has quite an intellectual pedigree as 
a logically attractive, theoretical origin of firm ILMs, ranging from human 
capital accounts (Becker 1964, 1985) to Oliver Williamson’s notions of task 
idiosyncracies and now asset specificity (Williamson 1981, 1985). The un- 
derlying concept is that of a bilateral monopoly: FSS make employees a 
valued asset not easily replaced from the outside; employees want to stick 
with the employer because their FSS lack value to other employers. 
Dissent from this view takes three forms. First, theoretical criticisms 
abound: Cole (1979:37-43); Goldberg (1980:263—5); Elbaum (1984:72); 
Osterman (1984b:174—76, 1987). Second, “no systematic and accepted mea- 
sure of specific training” exists (Cole 1979:43). If this argument has not 
deterred many recent attempts at such measures, some authors at least de- 
scribe their measures as imperfect or indirect (Baron et al 1986:255, using 
several types of training times; Pfeffer & Cohen 1984:559-60, using the 
presence of vestibule training and the amount of formal OJT in the absence of 
more direct information; Peterson 1988, using seniority). Third, for the most 
part, empirical results using measures of FSS are either mixed or simply 
inconsistent with this theoretical origin. In the mixed genre, Schroeder & 
Finlay (1986:830-33) compared two groups of laboratory workers— 
chemical/oil and hospital technicians—-with similar mixes of firm-specific 
and general skills, and they found that ILMs were present among the former 


INTERNAL LABOR MARKETS 153 


but absent from hospital settings. Carter & Carter (1985) linked only one of 
their three ILMs—the male office clericals at Lord & Taylor’s—with firm- 
specific skills. 

More disturbing are complex patterns of findings simply inconsistent with 
this account.* The mixture of internal and external hiring observed for the one 
FSS-based male clerical ILM at Lord & Taylor’s is echoed by Baron et al’s 
(1986) finding that the only significant positive association of FSS with jobs 
above entry level is with jobs which can be entered from the outside. 
Manwaring finds (1984:168) that tacit skills associated with specific rolling 
mills engender an extended ILM reaching for mill recruits into the community 
of employee’s acquaintances. At first glance this looks like a case of firm- 
specific skills creating an ILM; yet this instance of FSS breeches the boundary 
of the firm! In pursuit of FSS among four white collar occupations, Osterman 
finds that skills were more transferable than firm-specific for all four. His 
conclusion—that skill specificity cannot explain differences in craft, in- 
dustrial, and secondary ILMs~—appears more generally warranted. 

On-the-job training (OJT) is another theoretical source of ILMs. OJT is 
expected to operate in a manner similar to skill specificity. Hence their 
separate measurement has at times proven difficult, as has a separate empiri- 
cal assessment of alleged effects (Pfeffer & Cohen 1984:554, 564). If we set 
aside Pfeffer & Cohen’s association of OJT with FSS, we can view their 
finding which associates OJT with internal labor markets as support for OJT 
as a source of ILMs. The same support is indirectly suggested by both 
qualitative information and the significant coefficients for weekly earnings 
regressed on occupational experience for two of Carter & Carter’s (1985) 
ILMs. 

On the other hand, relative to formal OJT, the cost saving character of 
informal OJT with its characteristic “osmosis” (Doeringer & Piore 1971:18- 
33) is crucial to theoretical arguments (often cited in Thurow 1975:81—85, but 
found as well in Doeringer & Piore) about how the security and noncompeti- 
tive character of ILM job ladders fosters the safe transmission of skills from 
more to less experienced workers. With one exception there has been no 
attempt I know of to capture empirically this feature of work as a progenitor of 
ILMs. The exception is in Baron et al’s (1986:250, 261) finding that “jobs 


“Baron et al (1986:263) offer a striking test of this notion at the level of jobs. While greater 
firm-specific skills (FSS).are associated with jobs on ladders, other findings show that FSS are 
more present in jobs above entry level and are negatively associated with jobs at the bottom of 
ladders. If we look at Pfeffer & Cohen’s (1984) two measures of FSS, vestibule training is never 
significantly associated with their ILM measure; the other measure of FSS, formal employer- 
supported OJT, is not significant for manufacturing, only for nonmanufacturing organizations. 
Such findings that manufacturing jobs and entry-level jobs are disassociated or not associated 
with ILMs are hardly consistent with numerous statements of the FSS origins of ILMs. 
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directly associated with process technology [where there is a premium on the 
transmission of skills across workers] tended to be in longer, relatively 
structured ladders. . . .” Yet “the overall ILM scores for some of these 
establishments were low.” 

Indeed, the theoretical status of both specificity and even more, of OJT, as 
progenitors of ILMs is doubtful even in Doeringer & Piore (1971:31-33). A 
causal model of their account would show both skill specificity and an 
anticipation of cost savings by employers and employees alike as causes of 
ILMs, which, with their wage rigidity and job security, then produce work- 
able on-the-job training. By Goldberg’s account (1980), even firm-specific 
skill itself is a product of pre-existing ILMs; Cole makes the same point about 
Japanese ILMs (1979:39-43). 

ILMs have been seen as products of employer's needs for renewable 
supplies of otherwise scarce, highly skilled workers (Doeringer & Piore 1971, 
Finlay 1983, Carter & Carter 1985). There is a hint of this notion in historical 
accounts of conditions from which ILMs ultimately developed (Littler on the 
US and Japan, 1982:153, 165, 176; Cole 1979:103; Sengenberger on West 
Germany, 1981:245). Empirical support for this notion can be found in 
several studies. 

Osterman studied changes in ILMs as a means of grasping the principles of 
their origins. Thus, for example, given conditions of a scarcity (versus a 
surplus) of secretaries, firms chose to create career ladders (1987:54—5). Two 
of Bills’ three ILMs (1987:218) similarly respond to skill shortages: the 
engineers at Exurb and the tool-and-die apprentice program; even the apparent 
exception, a clinical progression ladder among nurses, reflects a deliberate 
attempt to upgrade nursing skill levels in a loose labor market (1987:218). 
Stark’s socialist ILMs, the enterprise business work partnerships (VGMs), 
reflect chronic labor shortages (1986, p. 494). These partnerships largely 
consist of skilled workers in heavy industry (p. 497). Having acquired 
technical skill is one of the structural factors influencing the likelihood that 
workers enter such groups; their skills are valorized (p. 496) not during 
regular hours of work, but when the groups work after hours. While any 
development of skill within such groups is unclear, the ability of employers to 
retain and use skilled labor and the motivation of workers to develop skill (as 
a factor in group membership) appears well served by this variant of an ILM. 
In Koike’s large steel mill, workers’ skill levels are principally enhanced by 
means of regular or irregular job rotation. Blauner’s (1964:149-50) refinery 
workers moving up through a series of progressively more responsible and 
skillful positions can represent numerous other American examples of skill- 
engendering ILMs. 

More generally, Pfeffer & Cohen (1984:557) test the view that ILMs 


INTERNAL LABOR MARKETS 155 


are more likely to be found in establishments reportedly facing labor scarci- 
ties. No relationship was found, but an establishment level test alone does not 
constitute an adequate test of the notion that scarcities of skill at particular 
points in time engender ILM job ladders using such skills. Indeed, no 
cross-sectional analysis may be adequate for tests of this notion, insofar as 
existing ILMs may have emerged in response to earlier, perhaps now altered, 
factors or conditions—another of the many examples of Stinchcombe’s 
(1965) correlation of age and structure (cited or illustrated by Cole 1979:103, 
Elbaum 1984, Kanter 1984:129). . 

If ILMs are a response to scarcity of skills through internal OJT efforts, we 
would expect that (a) increments of skill and knowledge would accompany 
job changing in job lattices, and that (b) ILMs would dynamically link the 
timing of movement up job lattices with increasing skill and knowledge. 
Findings consistent with these expectations would constitute an indirect test of 
the idea that ILMs originated as suppliers of needed skilled personnel. As to 
the first expectation, Baron et al (1986:268-69) reported that 83% of the 288 
job ladders as described by employers and job analysts had the expected 
gradients of increasing skill and knowledge. As for the second, an indirect 
indication of this dynamic linkage is evident in findings of positive duration 
dependence of job durations prior to job shifts, as noted above. White & 
Althauser (1984) found this type of dependence for male (but not female) jobs 
in two banks. And Althauser (1989) found greater positive duration depen- 
dence among job shifts of males in branch banking along career lines versus 
those among other combinations of gender, department, and career or non- 
career lines. Using log-logistic estimates, Peterson & Spilerman (1988) found 
duration in salary grade had a positive (but then declining) curvilinear effect 
on the rate of getting promoted within a large insurance company. 

Indeed, gradients of skill and knowledge may be structured by job rotations 
within an ILM that do not at the same time constitute job shifts through 
hierarchically classified jobs (Koike 1983:44). Employers in large Japanese 
firms thus achieve a degree of flexibility in the assignment of workers to tasks 
that a recent loosening of tightly defined job classifications and job ladders 
has sought to achieve in the United States and elsewhere (Finlay 1988, 
Heckscher 1988). 

A variation on this thesis—that ILMs grow or become more active during 
periods of rapid organizational growth—deserves attention in its own right. 
According to Finlay (1988), it is during such periods that a shortage of skilled 
workers engenders greater usage of ILMs; conversely, seniority-based JLMs 
become unneeded “if skilled and qualified workers in local labor markets are 
available.” Pfeffer & Cohen (1984: 558, 564) recall similar arguments (in 
Goldberg 1980, for example) but fail to find any empirical effect of either 


156 ALTHAUSER 


growth or decline on the presence of ILMs. Additional work on this notion 
would ideally look into the effects of different amounts of growth (e.g. does 
very rapid growth exceed the internal promotional capacities of firms?) and 
would separate growth or decline from the scarcity of skilled workers as 
sources of ILMs. 

While accounts of occupational segmentation, occupational labor markets, 
and craft labor markets are available (Stolzenberg 1975, Jackson 1984), a 
theoretical accounting of occupational ILMs remains to be developed, where 
OILMs are viewed as distinct from occupational labor markets or from ILMs 
within firms (Althauser & Kalleberg 1981). Obviously some of the specific 
origins of ILMs generally would not make sense in the case of OILMs, e.g., 
firm-specific skill. And some of the origins of occupational markets must 
apply to OILMs, given the essential features they share—more general skills, 
occupation- or profession-specific knowledge, and the fact that occupational 
markets seem more under the control of their incumbents than their employ- 
ers. 

There is, I would argue, an essential ingredient tying together accounts of 
both FILMs and OILMs in particular, even amidst the obvious difference in 
the parties typically in control. To be sure, most nonradical accounts, one way 
or another, rely on idiosyncratic or firm-specific skills to explain ILMs 
(Goldman 1980). But note: the underlying feature of firm-specific skills is 
that of the scarcity of skill, i.e. by definition, firm-specific skills are not 
available from the external market (Doeringer & Piore 1971:30). Consistent 
with this account is the recent change of commercial loan officer job ladders 
into OILMs as a response to a shortage of skilled lenders and the increasing 
skill required, given banking deregulation (Osterman 1984b:179). 

Perhaps this scarcity has general effects on the beginnings of ILMs of either 
FILM or OILM type. How great a difference can we expect between an FILM 
resulting from workers’ collective pressures, for example (Elbaum 1984:101) - 
and a OILM resulting from the pressures of craft, technical, or professional 
workers? How this basic scarcity comes into existence—perhaps a product of 
changes in technology or the social characteristics of the labor force or of 
overriding social and political processes-——is beside the point here. 

The foremost challenge to this account is that of explaining the differential 
origins of OILMs as opposed to occupational labor markets or occupational 
segmentation. In other words, under what conditions do occupational markets 
have the added OILM feature that growing skill and knowledge is temporally 
linked with job shifts, both within and across employers? Perhaps this added 
feature is-a reflection of a greater degree of occupational or professional 
control (perhaps influence over the amount of skill scarcity) that is present in 
OILMs as compared with occupational labor markets. 
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FUTURE RESEARCH: PROMISING APPROACHES, 
ENTICING THEMES 


Additional research is necessary and looks promising in several areas of work. 
First, Spilerman (1986:45) raises an important issue about the degree of 
correspondence between job ladders as discerned by personnel department 
staff or employment analysts and actual employee career histories. Data even 
from single firms (if carefully chosen)’which would permit:a direct study of 
this correspondence would be enormously valuable, given the dependence of 
past research work on data of each sort. As part of some work with job and 
salary shifts among a large sample of California civil servants that is still in its 
early stages, Mittman (1986) found that between 10% and 50% of all job 
shifts took place off official career lines. 

Second, concentrating for a moment on the range of data ever used in labor 
market work, a promising new world of exploration has opened up by the use 
of establishment level data sets already analyzed in Pfeffer & Cohen (1984) 
and in the recent works of Baron, Bielby, and their collaborators. The 
presence of information on job ladders and establishments, across so many 
establishments, 1s the most extraordinary feature of the data in Baron et al 
(1986). Heretofore, and perhaps for some time to come, we have been forced 
to choose between collecting rare information from an intensively studied, 
individual firm (a large utility in Rosenbaum 1984) and multifirm samples 
with far less intensive sampling within each firm (Rees & Schultz 1970, with 
employees in 13 occupational titles sampled across 74 establishments). While 
additional data sets of these sorts would help us further explore this new 
world, I do not foresee much more of the same until organizational data 
collection achieves greater institutional support. | 

Third, the theory-driven qualities of the best empirical and historic work 
reviewed above suggest that the field will be able to make valuable use of an 
increasing number of detailed case studies of concrete FILMs and OILMs. 
Perhaps these will be along the lines of Spilerman’s (1986) recent explora- 
tions of “simple” and “unitary” structure, DiPrete’s recent work on jobs in 
governmental ILMs (1987), and Osterman’s strategy (1984b, 1987) of look- 
ing at labor markets which shift from one type of ILM to another. The data 
collected should be comparative and inclusive enough to permit the 
corroboration possible when both numerical data and qualitatively rich data— 
perhaps grounded in consulting experience, recalling Kanter’s work (1984) or 
participant observation (Burawoy 1979)—are collected from the same source 
or site. 

Fourth, looking next at substantive themes, there has been obvious progress 
in specifying extra-firm, economic, and environmental influences on the 
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formation and operation of ILMs. We need only consider the distance traveled 
from the ground-breaking earlier works, whose common departure was 
taxonomies of industrial segmentation, to such works as Baron & Bielby’s 
(1984) theoretical and empirical treatment of environmental dominance, in- 
ternal organizational complexity and labor market structures; Baron et al 
(1986) already discussed above; Kanter’s (1984:128) enumeration of four 
classes of social and environmental variables that shape organizational struc- 
ture and ILMs for managers; Osterman’s (1984b) suggestive examples of 
changing product markets and shortages of labor; and Elbaum’s (1984) 
careful historic analysis of the steel industry. 

Perhaps the most enticing substantive theme of all is the embeddedness 
theme that has found new expression in recent works firmly grounded in both 
on-going theoretical debate and in fieldwork: Granovetter’s earlier (1974) and 
more recent work (1986) and his (1985) critique of Williamson; and Bura- 
woy’s (1979) revelations of labor process games within a manufacturing ILM. 
It is delightfully apparent in Granovetter’s retrospective (1985:499-500) on 
that part of Dalton’s (1959) ethnography describing the social and political 
character of an intrafirm audit. The contrast in Schroeder & Finlay (1986) 
between the presence of ILMs among chemical/oil workers and their absence 
among comparable hospital laboratory technicians could also be taken to 
illustrate this theme. But no better example can be found than in Finlay’s 
piece on crane operators, when he. reflects on Becker’s “specific skills” 
concept and its technical and economic implications for the creation of ILMs: 


Their skills have afforded them many of the benefits of steady employment and yet the 
most important benefit—the ready transferability of these skills between companies—is 
denied them because of social arrangements within the industry. Technically, their skills 
remain general, but socially, they are now enterprise-specific . . . The problem with [the 
radical employer control theory or that of Doeringer and Piore 1971] is that it assumes that 
a social arrangement, such as a labor market, is determined by technical changes in skill. 
Instead . . . [they] are constructed independently of skill: longshore crane operators are in 
two labor markets despite possessing equally a single general skill (Finlay 1983:312-13). 


Or, finally, the theme is apparent in Burawoy’s account of how ambiguous 
and precarious are the social relations between trainer and trainee (1979:101— 
6). This helps balance the importance placed by institutional and neoclassical 
accounts on OJT for the operation of ILMs and on the non-competitive 
protection of such training activities. 

Fifth, work is needed to substantiate and understand the possible decline of 
certain features often associated with ILMs (Pfeffer & Baron 1988, Finlay’ 
1988). How extensive are such recent efforts to collapse or dismantle nar- 
rowly defined job titles and work rules, as evident in the auto industry (Buss 
1984)? Attention should be given to the implications of recent treatments of 
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industry and organizational history and of the emergence of multiskilled craft 
work and flexible-specialization found in Piore & Sabel (1984), not to 
mention the now-noticed persistence of peripheral small firms (Granovetter 
1984). Even with future studies of the decline of ILMs or of various character- 
istics associated with ILMs (e.g. narrowly defined job classifications or job 
ladders), I hope that greater agreement on what we mean by an ILM and 
additional theoretical understandings of their origin and operation will keep 
pace with new understandings of their morbidity! 

Finally, more comparative labor market studies are needed, especially if 
they are sensitive to larger theoretical issues associated with ILMs and to 
detailed differences in the historic and cultural particulars of different nations’ 
labor market structures. The insights from the cross-national comparisons are 
often striking. I need only recall the theoretical remarks in Cole (1979) about 
general and specific skills in the Japanese case. But the advantages of 
comparative work are not confined to those involving ILMs across nations. 
Even within nations as suggested above, greater number of detailed, case 
studies of concrete FILMs and OILMs within or across nations will also help 
realize the great promise of comparative work. 
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Abstract 


This review of sociologically relevant alcohol research addresses definitions 
of alcohol problems, describes patterns and trends in adult drinking practices 
and problems and correlates of alcoholism, and describes social policy re- 
sponses to alcohol. With implications for many measures of social well- 
‘being, alcohol research is relevant to almost all areas of traditional sociologi- 
cal interest, intersecting with religious and ethnic studies, with studies of 
social change and social movements, with theories of social control, with 
criminology and social deviance, with media research and analysis of social 
organizations, with study of age and gender roles, with medical sociology, 
and with sociology of the work place. Sociologically relevant alcohol research 
of the last few years, while rich in the above areas, is by no means exhausted 
and holds great potential to illuminate issues of general interest to sociologists 
as. well as to specialists in medical sociology or deviance. 


LOCATING THE TERRITORY 


A potentially important question in the sociology of science is why, despite substantial 
involvement by sociologists in alcohol research, alcohol studies tend to remain outside the 
mainstream of sociology, Alcohol studies are insulated from a wider audience because few 
manuscripts find their way to publication in the principal sociology journals. 

(Blum 1984) 


As the above quotation (from a sociologist writing in a nonmainstream 
sociological journal) suggests, a major challenge to a review of sociological 
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alcohol research is identifying it. The identification process has at least two 
components. The first is a logistical one, in locating relevant studies in the 
vast array of largely nonsociological journals in which they may be published. 
This becomes exceedingly difficult when one realizes that an appreciable 
proportion of the literature is published in special technical documents (e.g. 
WHO Technical Reports, The Surveyor from the Alcohol Research Group at 
Berkeley, Addiction Research Foundation working papers) that are not readi- 
ly accessible to the general reader. A second problem is substantive; what is 
meant by “sociological alcohol research”? While one might propose to in- 
clude any alcohol research carried out by sociologists, such a definition is 
unsatisfying and parochial; it assumes that sociology is what sociologists do, 
ignoring the research of others that contributes to an understanding of how 
alcohol affects the dynamics and structure of society. Such a narrow defini- 
tion is particularly inappropriate for alcohol research because alcohol is at 
once a psychoactive drug, an agent with different cultural meanings, a 
substance used to cope with problems with social and other roles and to 
facilitate social interaction, and a commodity regulated by governmental 
agencies. By its nature, alcohol falls into the bailiwick of many disciplines. 
Likewise, consequences of its use fall into medical, criminal,. behavioral and 
social realms. With these points established, then, alcohol research reviewed 
in this paper draws on the work of researchers from many disciplines. To keep 
the paper’s size manageable, we have for the most part limited our review to 
publications since 1980. We first address various definitions of alcohol 
problems, including different assessment instruments and diagnostic criteria; 
next we describe patterns and trends in adult drinking practices and problems, 
and correlates of alcoholism; and last, we discuss the social response to 
alcohol, as revealed in policy issues. 


REFLECTIONS FROM THE OLD MASTERS 


Within the last few years, the Journal of Drug. Issues and the British Journal 
of the Addictions have each published a series of reflections on alcohol 
research by sociologists with distinguished careers in the field (Roman 1984, 
Mulford 1984, Trice 1984, Bacon 1984, 1985, Keller 1985, Pittman 1987a,b, 
. Straus 1984). These interviews and essays allude to many of the continuing 
controversies in alcohol research, including whether the focus on problem 
drinking is disproportionate to the description of the much more common 
social behavior of nonproblem alcohol use; the interdisciplinary nature of 
alcohol research and the accompanying jurisdictional problems; the cyclical 
nature of problem drinking; alcohol problems vs alcoholism as the study 
phenomenon of interest; whether controlling availability of alcohol has an 
effect on the prevalence of alcohol-problems; whether complete abstinence 
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from alcohol is the only path to recovery from alcohol problems. A review of 
these reflections would acquaint the reader unfamiliar with alcohol research 
with some of the still unresolved issues and questions in alcohol research 
today. 


DEFINITION AND MEASUREMENT 


The failure of researchers concerned with the alcohol problem to agree on the 
selection of problems or on a definition of alcoholism complicates cross-study 
comparisons. To provide some unity to these varying definitions in alcohol 
research, a project is currently underway to subject 33 longitudinal data sets to 
meta-analysis to synthesize findings about the emergence and persistence of 
alcohol problems (Temple et al 1987). This project reanalyzes existing data, a 
relatively inexpensive research strategy with potentially large rewards (In- 
stitute of Medicine 1987). Such analyses, it is hoped, will reveal the extent to 
which findings replicate across studies, and when they do not, the extent to 
which discordant results can be explained by the different eras in which 
studies were carried out, or the differences between samples with respect to 
culture or age cohort. The collaboration of the original investigators of each 
study in the meta-analysis should enhance the likelihood of meaningful 
findings from this project. 


The Disease Concept 


No review of sociological approaches to alcohol can ignore the substantial and 
heated protests of some sociologists against the disease-concept of alcohol- 
ism. This issue has created a schism in sociological alcohol research only now 
beginning to be bridged. Arguments mustered against treating alcoholism as a 
disease include the following (Vaillant 1983): Alcoholism does not meet the 
traditional medical concept of a disease because of the voluntary ingestion of 
the causal agent. Second, there is no natural cutting point that divides drinkers 
into “cases” and “noncases,” since alcohol consumption- itself forms a con- 
tinuous distribution. Third, there is stigma associated with labelling a person 
as an alcoholic. Lastly, defining excessive alcohol intake as a disease gives 
doctors and other medical professionals a jurisdictional claim over drinking 
behavior and makes an unvalidated claim that medical professionals enjoy 
special skills in treating it. 

Sociologists protesting the medical model have to overcome a long history 
of a medical view of alcoholism, dating from Benjamin Rush’s treatise in the 
early nineteenth century and continuing today, both in the Alcoholics Anony- 
mous view that alcoholism is like an inborn and irreversible allergy and in 
the inclusion of alcohol disorders in the International Classification of Dis- 
eases, used throughout the world. Some sociologists argue that empirical 
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investigations have failed to confirm the postulates underlying the disease 
concept of alcoholism (Room 1983d, Poikolainen 1982, Watts 1981). Rather 
than a single, irreversible disease for which abstinence is the only remedy, 
alcoholism can be shown to be heterogeneous and cyclical. In addition, 
abstinence from alcohol is not always a necessary condition for recovery. A 
final point made is that alcoholism represents a very small part of societal 
alcohol problems. Even if medical treatment could successfully combat 
alcoholism, many of the costs of drinking as revealed in traffic and work 
accidents and in disinhibition (with its costs in fights, sexual.imprudence, and 
various illegal behaviors) would not be effectively addressed. 

The medical community currently agrees that alcoholism is heterogenous 
but does not find that this heterogeneity disqualifies alcohol abuse as a 
medical illness. Its heterogeneity is accepted on both phenomenologic and 
genetic evidence, and efforts are underway to subclassify alcoholics in order 
to improve the prediction of heritability (Cloninger 1987). Evidence for the 
progressive nature of alcoholism has been equivocal. In part, analysis has 
been stymied by a lack of assessment instruments capable of tracing the 
course of the disorder, but again the possibility of recovery is not evidence 
against the medical model. A large general population study, the 
Epidemiologic Catchment Area (ECA) study, applies the official criteria of 
the American Psychiatric Association to both treated and untreated cases. 
This study has confirmed the spontaneous recovery of many alcoholics with- 
out treatment, and the fact that a significant proportion of those who reco- 
vered now drink without problems. In fact, the ECA study has established that 
although only a minority of alcoholics ever come to treatment (a finding 
which suggests that treated alcoholics are unrepresentative of all alcoholics) 
(McEvoy et al 1989), most of those who had at some time met diagnostic 
criteria as alcoholics were symptom free for at least the year prior to in- 
terview. Another finding consonant with the antimedical argument is that 
alcohol sometimes causes problems for persons who do meet criteria for the 
disorder. Yet those who think of alcoholism as a disease find none of these 
results a telling argument against their view. 


Assessment Instruments 


There are many instruments for assessing alcohol problems and alcoholism. 
To make a diagnosis of alcoholism according to an official diagnostic 
nomenclature such as the Diagnostic Statistical Manual of the American 
Psychiatric: Association, interviews and questionnaires must ask questions 
that map onto the diagnostic criteria. No official criteria need be followed if 
the study focus is the occurrence of alcohol problems selected by the in- 
vestigators. For the follow-up study to the seminal survey oñ drinking prac- 
tices of Americans (Cahalan et al 1969), Cahalan & Room developed several 
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drinking problem scales (1974) that have been adopted by some other studies. 
However, a recent review of alcohol survey research methods (Auth & 
Warheit 1982) finds that survey instruments are often unique to each study. 
These scales include drinking patterns (heavy intake, binge drinking, symp- 
tomatic drinking), psychological involvement (dependence, loss of control), 
drinking consequences (health, financial, personal, legal, and occupational 
problems), and meanings associated with alcohol. 

A common difficulty seen in these problem assessment scales is the failure 
to specify the time frame clearly. For example, the Michigan Alcoholism 
Screening Test (MAST) mixes lifetime with current experiences, and the time 
span identified as “current” is undefined (Selzer 1971). Common time frames 
include within 1 year, within 6 months, or within the last month. The sporadic 
nature of alcohol problems makes it critical to choose a time frame long 
enough to identify those with serious problems. 

Although there is more standardization with respect to quantity of alcohol 
consumed, variability exists in these measures as well. The time frame may 
be the- last 30 days, the last 2 weeks, or the past year. Because drinking 
practices more than one year ago are rarely ascertained, correlates of absten- 
tion are confusing (Goldman & Najman 1984). Current abstainers who once 
drank differ markedly from lifelong abstainers not only in demographic 
characteristics but also in current health (many quit drinking because of 
illness). Current abstainers include many persons with past alcohol problems, 
who chose abstinence to control them (Wilsnack et al 1984, Klassen & 
Wilsnack 1986, Warheit et al 1988b). 

Many of these methodological problems have been overcome in recent 
years as standard psychiatric diagnostic criteria covering the whole life span, 
as well as the recent past, have been operationalized in several instruments 
(see HELPER, SADS, DIS, SCID, CIDI, CIDI-SAM). The newest of these, 
the CIDI-SAM, applies criteria of the newest revision of the American 
Psychiatric Association (DSM-III-R 1987) as well as ICD-10 (Cottler et al 
~ 1989). These instruments do not obtain information about usual drinking 
practices, since diagnosis is based on problems only. This difference in 
purpose prevents linking studies using these instruments to community-based 
studies of drinking practices. Although high quantity/frequency and problem 
measures tend to select the same persons, the overlap is not so complete as to 
resolve the comparability issue (Clark 1966). 


DRINKING PRACTICES AND PROBLEMS 
Consumption 


International comparisons of drinking habits are possible based on consump- 
tion data as well as on interview. The United States ranks eighteenth in the 
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world in terms of alcohol consumed per capita (Mendelson & Mello 1985). 
Per capita ethanol consumption increased between 1950 and 1982, with a 
particularly sharp increase (22%) between 1960 and 1975. Since 1982, 
however, decreases in overall consumption have been recorded (G. D. Wil- 
liams et al 1986). 


DEMOGRAPHIC CORRELATES Consumption figures, unfortunately, tell us 
nothing about how the use of alcohol is distributed in the society. Societies in 
which every adult drinks a little can have per capita consumption rates 
identical with those of societies with many abstainers and many very heavy 
drinkers. Sixty-four percent of Americans drink alcohol, according to recent 
estimates (National Center for Health Statistics 1986). Compared to women, 
more men drink and they drink more heavily. The proportion of abstainers 
increases with age. While it is generally believed that older drinkers also drink 
less than the young, a recent study (Warheit et al 1988b) suggests that older 
drinkers drink as much as younger drinkers. For many ethnic-subgroups in 
America, the highest rate of heavy drinking has been found among young 
males, although recent studies suggest considerable variation among Asian- 
Americans, Native Americans, blacks, and hispanics (Kitano 1988, Caetano 
1984b, Herd 1989, Weibel-Orlando 1989). One important variable appears to 
be the degree of acculturation, with rates higher among the more acculturated 
(Institute of Medicine 1987, Caetano 1987). 


Problems and Disorders 


In the last 10 years, surveys that make alcohol diagnoses along with diagnoses 
of other psychiatric disorders have made it possible to assess alcohol’s relative 
public health importance. Assessing disorders over the lifetime, these studies 
show alcohol abuse and dependence to be the most common of all mental 
disorders (Robins et al 1984). Recent studies also show that alcoholism is 
very unevenly distributed through the population. 


AGE Age is such a powerful correlate of drinking problems and alcoholism 
that it has been termed an “organizing principle” for both men and women 
(Fillmore 1987a,b). Among men, follow-up: studies have revealed young 
adulthood (twenties) as the time of appearance of the first alcohol problem, 
middle age (thirties) as the appearance of alcoholism, and old age (sixties) as 
the abating of alcohol problems (Fillmore 1987a). Women have both later 
onset and shorter duration of drinking problems. Exploration of gender 
differences in the waxing and waning of alcohol problems is an eres and 
potentially fertile area of research for sociologists. 


ETHNICITY . Studies onnan drinking practices and problems of blacks 
and whites generally reveal that, compared to whites, blacks have much 
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higher abstention rates, drink less often, report fewer episodes of drunken- 
ness, and start drinking later (Warheit et al 1988a, Herd 1989). Heavy 
drinking peaks later for black men than for white (18-29 for white males, but 
30-39 for black males) (Caetano 1984a, Caetano & Herd 1984). Similar 
differences are found when alcoholism is the phenomenon of interest (Robins 
1989, Helzer & Burnam 1988). In the youngest age groups, whites have a 
lifetime prevalence twice as high as blacks, but with increasing age, black 
rates exceed those for whites. This ethnic difference in age patterns holds for 
both men and women. Despite their delay in beginning alcohol consumption, 
heavy drinking blacks have more health problems, particularly more cirrhosis 
mortality, and more social consequences of alcohol use than do whites (Herd 
1985, 1987; Warheit et al 1988a;-Grant et al 1986). 

Other ethnic groups have been little studied by social scientists, although 
there are interesting opportunities to relate a biological phenomenon, the 
flushing response of Asians and American Indians, to the very variable rates 
among Asian and Indian populations, varying from the low rate of alcoholism 
in the Chinese to extraordinarily high rates in some American Indian tribes 
and among Koreans. 


REGIONAL VARIATION There is great regional variation in American drink- 
ing practices and problems (Room 1983b). Per capita consumption trends 
establish the “wet” regions of the United States as New England, the mid- 
Atlantic, East North Central, and Pacific regions; the driest region is the 
Southeast. “Dry” regions have been reported to have higher per drinker 
consumption rates than “wet” areas. However, Southeastern drinkers are not 
more likely to become alcoholic than are drinkers elsewhere (Helzer & 
Burnam 1988). 


RELIGION Once a:Sstaple variable in sociological research on drinking, 
religion is now a rare topic. Its omission from some: large recent surveys 
reflects the US government’s decision not to permit questions about religious 
preference in surveys conducted under government contract or collaborative 
agreement, although in studies of adolescents, church attendance (with no 
specification of denomination) is allowed as a surrogate for conventional 
attitudes. Nonetheless, the low rate of alcohol problems and alcoholism 
among Jews, stable over the whole history of American surveys on drinking 
problems, continues to be of interest because Jews are rarely- abstainers 
(Glassner & Berg 1980, 1984). Cited as forces fostering their moderation are: 
the association of alcohol abuse with being a non-Jew, integration of mod- 
erate drinking into religious and family rituals, restriction of primary social 
networks to moderate drinkers, and techniques to avoid excessive drinking. It 
is not known whether these explanations are unique to. Jewish moderation or 
equally true of moderate drinkers generally. 
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SOCIAL CLASS Like religion, social class is now an infrequent topic in 
alcohol research. Here the findings differ according to whether the research 
topic is drinking patterns or problem drinking. Both abstention and drinking 
problems are generally reported to be high in the lowest social class, while 
regular and even heavy drinking is common in the higher socioeconomic 
classes. The ability of the higher social classes to drink considerably without 
adverse consequences has not been well explained. Possible explanations 
include genetic selection and problem drinkers’ rapid exodus from the middle 
class through downward occupational mobility. 


Changes in Women’s Drinking Patterns and Problems 


A series of surveys (Clark & Midanik 1982) indicate that overall American 
drinking patterns have remained stable since 1971. Of course, this need not 
mean that there have not been temporal changes for various subpopulations, 
changes which balance out. 


CHANGES IN WOMEN’S DRINKING Recently, there has been special interest 
in tracking changes in women’s drinking practices (Fillmore 1987b, S. C. 
Wilsnack et al 1986, 1988, S. C. Wilsnack 1988, Wilsnack & Cheloha 1987). 
In 1981, a modest increase in heavy drinking among women aged 35—49 was 
observed. In a 20-year follow-up, Fillmore (1987b) tempered her findings that 
there was a higher proportion of heavy frequent drinkers in younger, as 
compared to older, cohorts of women; other studies did not find those 
differences. 


CONSEQUENCES FOR REPRODUCTIVE HEALTH As women’s drinking has 
increased, concern has developed about the consequences for women’s repro- 
ductive experiences and sexual dysfunction (Wilsnack et al 1984, Klassen & 
Wilsnack 1986, Wilsnack et al 1987). Cross-sectional data (S. C. Wilsnack et 
al 1984) suggested that women who drink extremely heavily have an excess of 
menstrual dysfunction, miscarriages, gynecological surgery, infertility, pre- 
mature babies, and babies with birth defects. Temporary abstainers, those 
women who had drunk alcohol in the last year but not in the last 30 days, were 
identified as a group with particularly high rates of menstrual dysfunction, 
hysterectomies, and other gynecological problems. Because birth defects 
were not rigorously defined nor the smoking status of the mother controlled 
(an important omission because drinking and smoking are highly correlated, 
and mothers’ smoking is definitively related to low birth weight), further 
research was needed to determine the extent to which serious birth defects are 
associated with mother’s heavy drinking. Recently available longitudinal data 
support these cross-sectional findings (Wilsnack et al 1988). The longitudinal 
design should provide a better understanding of the causal order, since 
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adverse life events such as the gynecological and obstetrical difficulties may 
lead to drinking as well as vice versa. 


CHANGING WOMEN’S ROLES AND DRINKING A second -area of research 
attempts to assess how changes in women’s roles relate to their alcohol 
consumption. Findings to date are equivocal, but intriguing. First is the report 
by Fillmore (1984) of a sharp increase in the ratio of heavy drinkers among 
young employed (vs unemployed) women between 1964 and 1979. Wilsnack 
& Cheloha (1987) found that loss of employment, as well as loss of marital 
and child-rearing roles, put women at risk for problem drinking. Reconciling 
results of these two studies seems possible if women went to work when their 
marriages broke up or their children left home, and thus became at risk of job 
loss as well. 


EFFECTS ON SEX RATIOS [In a thorough report, Ferrence (1980) computed 
sex ratios for daily alcohol use, heavy alcohol use, and problem drinking 
reported in numerous adult and adolescent surveys, along with sex ratios of 
mortality from various diseases attributed to alcoholism, and concluded that 
there was little evidence to support convergence of rates in men and women. 
However, Ferrence used overall rates, not rates for individual age groups. 
Other recent studies that examined rates by age cohort suggest that rates of 
alcoholism between the sexes are converging in the youngest age cohorts 
(Helzer & Burnam 1988, Reich et al 1988). 


Drinking and Other Pathology 


ILLICIT DRUG USE Alcohol use is embedded in a variety of forms of deviant 
behavior. This has been well-established by studies of the initiation and 
progression of adolescent substances use (Kandel & Logan 1984, Yamaguchi 
& Kandel 1984, Raveis & Kandel 1987) showing that alcohol use functions as 
a gateway to illicit substance use and is itself predicted by a lack of identifica- 
tion with school achievement and conventional attachment to parents’ values. 
In addition, Robins & McEvoy (1989) have shown that conduct disorder in 
childhood predicts unusually early use of alcohol and the development of 
alcohol problems. 

Concomitant use of alcohol and other drugs has been less well-studied in 
adults, no doubt because drug use is rare in adults over 40, but the association 
between alcohol problems and drug abuse has been well established in both 
clinical and population samples (Taylor & St. Pierre 1986, Helzer & Przybec 
1988). 


ANTISOCIAL BEHAVIOR AND CRIME There is also a strong association 
between antisocial personality disorder and alcoholism (Lewis et al 1985, 
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Hesselbrock et al 1985, Helzer & Przybeck 1988). Antisocial alcoholics have 
earlier onsets of alcohol use and of problem drinking and often: a worse 
prognosis. In addition, the combination of alcoholism and antisocial personal- 
ity appears to have a higher heritability than does simple alcoholism. 

The criminological literature often cites the role of alcohol as a facilitator of 
criminal acts, particularly crimes against persons. Lenke (1982) analyzed 
ecological data from several Scandinavian countries on alcohol consumption 
and violent crimes over long time periods and found a positive correlation 
between alcohol consumption and violent crime rates. Ecological studies, of 
course, do not tell us whether these two behaviors occurred in the same 
persons. Welte & Miller (1987) compared self-reported drug and alcohol use 
(both chronic use and that just prior to the crime) among subjects incarcerated 
for violent and property offenses. Although those committing violent crimes 
were more likely than property offenders to have been drinking, a majority of 
both types had been drinking just prior to the crime. Through discriminant 
analysis, it was found that the association between violent crime and alcohol 
was particularly strong for women and those with higher education (high 
school or beyond). 

While these studies indicate that among those convicted, alcohol use is 
common just prior to committing a crime and particularly just prior to crimes 
against persons (Room 1983a), evidence is still lacking as to whether those 
convicted were more often drinking at the time of the crime than at other times 
during the day, and if so whether the drinking contributed more to their 
committing the crime or to their getting caught for it (Collins 1986). These 
issues are critical because the overall level of heavy drinking and alcohol 
problems among criminals is high compared to levels for the general public 
(Greenberg 1981). 

In a 33-year follow-up of men who as children had been enrolled in a 
program to prevent delinquency, McCord (1981) found that over 50% of 
subjects who became criminals were alcoholic as well. She also found that 
alcoholic criminals came from family backgrounds more similar to those of 
other criminals than to those of other alcoholics. She concluded that the 
etiology of alcoholism among criminal and noncriminal alcoholics was prob- 
ably different; recent genetic studies support her conclusion (Cloninger 1987). 


DEPRESSION Because alcoholic patients often have symptoms of depres- 
sion, an association between these two disorders has also been sought in the 
general population. However, analysis of data from a large community survey 
of psychiatric disorder found no stronger a relationship between alcoholism 
and depression than between alcoholism and other psychiatric disorders (Hel- 
zer & Przybeck 1988). Presumably their association in clinical samples was 
explained by the fact that suicidal ideation in alcoholics is an important reason 
for their referral to treatment. 
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SOCIAL RESPONSE TO ALCOHOL PROBLEMS 


Social response to alcohol problems may be thought of in two areas: policies 

-aimed at the control of.alcohol, which we will call public control strategies, 
-and policies aimed at deviant. drinkers, which we. will call personal control 
policies. Public control policy is defined by Room (1984) as “any intervention 
by the state in controlling the production, distribution and sale of alcohol as a 
means of preventing alcohol problems:” Examples are taxes, licensing of 
vendors, and limiting advertising. Examples of personal control policies are 
laws against drunk driving, laws setting a minimum drinking age, and provi- 
sion of treatment. 

Recent studies of public control policies have covered the imposition of 
higher taxes on alcohol products; restrictions on vendors, purchasers, and/or 
sales outlets for alcohol; and restrictions on advertising alcohol products 
(Ross 1984, Smart & Goodstadt 1977, Single et al 1981, Single & Eakins 
1981, Moskalewicz 1981). There is controversy over whether laws aimed at 
reducing overall consumption of alcohol will reduce alcoho! problems. Op- 
ponents argue that the demand of addicted drinkers (presumably the bulk of 
those with alcohol problems) is inelastic. Since addicts will not reduce their 
drinking, the nonproblem drinker will bear the burden of higher prices for 
alcohol or restricted outlets or hours for purchasing alcohol, and all to no avail 
so far as reducing the number of addicts. Critics also note the costs of such 
programs in terms of added administrative needs and job losses in the alcohol 
industry itself and in related industries such as tourism and restaurants. The 
alternative is to try to reduce the numbers of alcoholics directly by increasing 
treatment facilities, by more prevention, or through more severe and more 
certain punishment for crimes committed when drinking. On both sides of the 
argument are questions about cost-benefit ratios and tem cement of human 
rights (Makela & Room 1985). 


Evidence for the Riscivenexs of Price and Vendor 
Restrictions 


Three recent summaries of the evidence. on the effectiveness of alcohol 
control policies provide weak (Ravn 1987), cautious (Rush & Gliksman 1986) 
and strong (Room 1984) endorsements of the strategy of controlling consump- 
tion. All three agree that control policies might form one component in an 
overall program to reduce alcohol problems; what they disagree about is how 
prominent a component such a strategy should be. 

Sometimes lost in the often heated debate is the fact that the control of 
consumption argument is based on two assumptions: first, that governmental 
policies can affect the general level of alcohol consumption, and second, that 
a reduction in overall consumption will lead to reduced rates of alcohol 
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problems. These assumptions are independent, and both must be demon- 
strated to be correct to support the effectiveness of control policies. 

Alcohol control policies can reasonably be expected to influence alcohol 
consumption, since price and alcohol consumption are inversely related (Levy 
& Sheflin 1983, Rush & Gliksman 1986). However, few studies have ad- 
dressed the second component of the debate. Those that have are generally 
reports of natural experiments, evaluated ecologically, in which aggregate 
consumption and aggregate problem measures are assessed pre and post some 
change in policy. Perhaps no better example of a natural experiment in control 
exists than Prohibition in the United States. Current historical reanalyses 
suggest that alcohol problems, as measured by rates of cirrhosis, registered a 
sharp decline during Prohibition, a decline that continued for some years after 
Repeal, reflecting the delay between onset of heavy drinking and onset of 
physical damage (Room 1984). Other studies of natural experiments such as 
rationing of alcohol in Greenland, the liquor store strikes in Scandinavia, and 
changes in alcohol availability in Poland and in California provide evidence 
that policy changes do indeed affect both the supply of alcohol and the rate of 
alcohol problems (Schechter 1986, Makela 1980, Longnecker et al 1981, 
Moskalewicz 1981, Rabow & Watts 1982). However, several other studies 
(Rush et al 1986, Fitzgerald & Mulford 1984, Smart 1987) find that changes 
in consumption are not necessarily paralleled by changes in problems. A 
study comparing two Canadian provinces over 10 years observed approx- 
imately equal decreases in problems despite a four-fold difference in the drop 
in consumption. Further, a third province experienced a decline in some 
problem measures while alcohol consumption increased (Smart 1987). 

Even when effects on alcohol problems are clearly positive, alcohol control 
policies may have adverse side effects. The study of rationing of alcohol in 
Greenland (Schechter 1986), which found a sharp decrease in alcohol con- 
sumption, also found associated with it an increase in narcotics offenses, the 
growth of a black market, and an increase in thefts and burglaries. 

The ecological design of all these studies precludes directly addressing the 
question of whether reduction in the overall level of consumption will alter the 
drinking patterns of problem drinkers. However, a study that used interviews 
with “regular” drinkers rather than ecological evidence to assess the effects of 
an increase in excise taxes on alcohol (Kendell et al 1983) did find that heavy 
and dependent drinkers reduced their alcohol intake at least as much as did 
light or moderate drinkers, and that the heavy drinkers experienced a reduc- 
tion in their alcohol problems. Unfortunately, no assessment was made over a 
comparable control period elsewhere to allow separating the effect of the 
increase in excise taxes from the natural history of a decrease in intake and 
alcohol problems with age. Evidence, then, that policies such as taxing and 
rationing that successfully reduce overall consumption of alcohol also lead to 
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reducing alcohol ‘problems is still equivocal. We need better evidence that 
reductions occur, that they endure once previously excessive drinkers get used 
to the change in price or availability, and that the alcohol intake is not 
replaced by perhaps more dangerous substances. 

The theory that gave the consumption control devotees their start, namely 
the Ledermann curve, is no longer widely supported (Pittman 1980, 1983, 
Ravn 1987b). Briefly, Ledermann’s assumption that alcohol consumption is 
distributed lognormally implies that lowering average consumption will shift 
the curve to the. left, thereby reducing the number of heavy drinkers. It is 
generally agreed that Ledermann’s data bases were inadequate, and that the 
lognormal distribution does not describe the distribution of alcohol consump- 
tion well. However, abandoning the Ledermann theory does not mean that 
controlling alcohol consumption does not affect the rate of alcohol problems. 
As Rush & Gliksman (1986) note, the evidence, while not incontrovertible, is 
nonetheless “scientifically reasonable” in favor of the control argument. 


Alcohol in the Media 


The literature on alcohol and the media can be divided into (a) content 
analyses of advertisements in magazines and on television and of the portrayal 
alcohol use in television dramas and the cinema, and (b) investigation of the 
effects of alcohol advertising on consumption. 


MEDIA TREATMENT OF DRINKING 


In advertisements Content analysis of alcohol advertising in magazines 
reveals that it dwells on the virtues of a given product. Human models are 
infrequently depicted, and when they are, they are usually male. The over- 
whelming majority of magazine advertisements are for distilled spirits, prob- 
ably because advertisements for distilled spirits are banned from radio and 
television. A higher concentration of alcohol advertisements are found in 
magazines aimed at the well-educated, men, and blacks than in those directed 
at women, young adults, or teens (Strickland & Finn 1984, Strickland et al 
1982) 

Unlike magazine advertising, a majority of television advertisements con- 
tained human models (Finn & Strickland 1982). Advertisements for beer 
predominate over advertisements for wine and are again aimed heavily at 
males. No studies have yet successfully tested the advertisers’ claim that 
advertisements do not increase total alcoho] consumption of alcohol abuse but 
only account for shifts from one brand to another. 


In dramatic productions The Alcohol Research Group at Berkeley un- 
dertook a systematic. analysis of the portrayal of alcohol in American films as 
a means of studying drinking norms and changes in attitudes towards drinking 
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(Room 1983c). Films of the 1930s and 1940s (i.e. post-Prohibition) were 
considerably “wetter” than earlier ones, reflecting the positive post- 
Prohibition-era attitude toward alcohol, as well as a less moralistic general 
climate. Contrasting portrayal. of alcoholism and alcohol problems in two 
cinematic eras, 1921—1939 vs 1961-1979, Herd (1986) notes a’shift to a more 
negative image of alcoholism in the more recent films, with attention given to 
female alcoholics, sexual dysfunction and other negative consequences of 
alcohol use. She ties this change to a shift in the social view of alcoholism, 
from one rooted in the substance (“demon rum” in the twenties) to one that 
places its origin in particularly vulnerable persons. 

Researchers have noted that in television dramas, alcohol use is common, 
exceeding the use of all nonalcoholic beverages combined (Defoe et al 1983, 
Macdonald & Estep 1985), and it is generally portrayed as being without 
negative consequences (in sharp contrast to depictions of use of illicit drugs). 
From 1976 to 1985, the number of alcohol-related events (characters both 
preparing to use and drinking) increased in television series (Lowery 1980, 
Breed et al 1984, DeFoe et al 1983, Macdonald & Estep 1985). 


EFFECTS OF THE MEDIA ON CONSUMPTION 


Surveys on advertising effects Strickland (1983) related teenagers’ exposure: 
to televised alcohol advertising to their drinking behavior and drinking prob- 
lems. Drinking behavior was assessed by self reports of usual alcohol con- 
sumption. Problem drinking was rated based on frequency of intoxication, 
symptomatic consumption, psychological and other negative consequences of 
alcohol use, as well as whether drinking was reported to be a problem in the 
last year. Information on peer drinking and parental drinking norms, and 
differential peer associations was also obtained and used in the multivariate 
analysis. Exposure to advertising was inferred from the amount of television 
viewing in the past month. Advertising was found to have a negligible impact 
on consumption levels or alcohol problems, while interpersonal factors such 
as peer associations appeared to be more influential. However, the ambiguous 
time frames of the measures of alcohol use and imprecise assessment of 
exposure to advertisements make these. conclusions uncertain. 

In a study of the effect of alcohol advertising on excessive. and harmful 
drinking, Atkin et al (1983) surveyed subjects sampled from various school 
settings (high school, college, trade school) or solicited at shopping malls, 
social clubs, and through newspaper advertising. Subjects were asked about 
usual heavy drinking behaviors and about hazardous drinking practices (such 
as driving after drinking). An advertising exposure index was created based 
on self-reports of.exposure to certain magazines or television programs and 
the amount of attention given to alcohol-advertisements, including common. 
magazine advertisements that were shown to each subject..A moderate posi- 
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tive association was found between both drinking measures and the advertis- 
ing exposure index, and this remained when factors such as family drinking 
patterns were controlled. However, neither advertising exposure nor most of 
the problem drinking indicators referred to a specific period of time, leaving 
as a possible alternative interpretation of the assoċiation that high alcohol 
consumers watch more television or pay more attention to alcohol advertising. 


Experiments in media exposure Kobn & Smart (1984) subjected volunteer 
male undergraduates to 0, 4, or 9 beer commercials during a four-hour 
videotaped soccer game. Subjects were told that the purpose of the study was 
to evaluate the appeal of televised indoor soccer. During the experiment, 
subjects had immediate or delayed access to beer and soft drinks. Amounts 
drunk were measured. The authors found that male subjects increased their 
beer consumption in response to beer advertising in the first half hour of the 
experiment, but not thereafter, regardless of the intensity of the beer advertis- 
ing. Because no later effects were observed, the authors cautioned against 
generalizing from studies in which observation periods are brief. 

A similar experiment was carried out with female subjects; advertisements 
shown.and refreshments offered were changed to wine instead of beer (Kohn 
& Smart 1987). In addition, the authors measured each subject’s suspicious- 
ness about the experiment. They found that female students increased their 
wine consumption only after exposure to many wine commercials. However, 
women who were more suspicious about the experiment tended to drink 
nothing or a lot, independent of exposure to alcohol advertisements. When 
data were reanalyzed using only nonsuspicious subjects, advertising did not 
have a significant effect on amount of alcohol consumed. 

Sobell et al (1986) assessed the drinking behavior of male college students 
who were exposed to two different versions of a one-hour television program, 
one which included alcohol scenes and references, the other with all reference 
to alcohol edited out. A second experimental condition varied whether the 
first advertisement shown was for beer, a nonalcoholic beverage, or food. 
After viewing the program, subjects were asked to taste several types of light 
beer (actually, all the same. brand); the amount of beer consumed was the 
outcome variable. The authors reported that neither exposure to alcohol 
scenes nor to alcohol advertisement led to an increase in the amount of beer 
consumed. 

In none of these studies has strong evidence been found for a link between 
exposure to alcohol in the media and either alcohol consumption or alcohol 
problems. Survey results appear to have been vitiated by the ambiguous 
and/or nonconcordant time frames between drinking behaviors and the 
advertising exposure. The three experimental studies, all in one-sex samples 
and without any investigation of delayed effects, may not generalize to 


178 BUCHOLZ & ROBINS 


mixed-sex behavior in real-life settings or to effects over longer intervals. 
Overall, the available evidence that there is no connection between alcohol 
advertising and alcohol consumption is weak. 


Drinking-driving Laws 


A recent review of drinking-driving laws worldwide (Ross 1984) found that 
the effect of legislative changes appears to be short-lived. In the wake of 
stringent laws against drunk driving, the consistent findings (Hingson et al 
1987) seem to be that drunk driving rates decrease initially, but then start to 
creep up again to pre-law levels, sometimes after only a few months. In- 
creased media attention to the issue of drunk driving in the immediate pre- and 
post-law periods may account for the decrease as much as do the legislative 
changes themselves. The consistent finding that effects of stiffer drunk driv- 
ing legislation are transitory may lead to penalties more extreme than yet 
instigated or to giving more attention to strategies that reduce the drunk 
driver’s likelihood of serious or fatal injuries. Among these strategies are 
requiring airbags in automobiles, removing dangerous highway obstacles, and 
instituting taxi service to and from drinking establishments. 


Minimum Drinking Ages 


During the 1970s, many states lowered their legal drinking age from 21 to 18, 
providing research opportunities for assessing the effects of this change on 
teenage drinking patterns and on teenage auto accidents and traffic fatalities. 
This trend was then reversed in the 1980s. In a 1977 review, Smart & 
Goodstadt concluded that lowering the drinking age resulted in higher 
proportions of young people drinking and possibly in an increase in alco- 
hol-related automobile accidents and other alcohol problems as well. They re- 
commended a moratorium on lowering the drinking age. Several states 
subsequently raised the minimum age. 

Three years after the drinking age was raised in Massachusetts from age 18 
to 21, Smith and colleagues (1984) compared results with the same period in 
New York, where the drinking age had remained at 18 over the same time 
period. They found neither a relative decline in drinking among Massachu- 
setts teenagers nor a substantial reduction in single vehicle nighttime fatal 
crashes, although they did observe a significant decrease in nonfatal crashes. 
In contrast, Williams & Lillis (1986) reported a significant decrease in both 
moderate and heavy drinking among teenagers in New York State in the 
period following a rise in the minimum drinking age. However, the study was 
conducted only one year after the rise, so it is unknown whether the changes 
were stable. Clearly, the effects of changes in the legal drinking age have not 
been settled. As Ross (1984) notes, data on-traffic accidents and fatalities, 
routinely collected for many years, offer a valuable opportunity for research- 
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ers to compare rates of drunk driving and nonfatal and fatal traffic accidents 
over extended pre- and post-law periods. 


Treatment 


As might be expected from the fact that treatment of alcoholism is largely in 
the hands of clinicians, sociological research plays a relatively small role in 
this area. We review several kinds of sociological contributions to the treat- 
ment literature. o 

First are studies of the provider-client relationship. These studies have dealt 
particularly with bias against female patients (Leigh & Gerrish 1986, King & 
Hahn 1981, Rice & Shaw 1984, Vannicelli & Harrison 1984). A technique 
frequently used is to compare clinicians’ responses to identical descriptions of 
cases, with only the gender changed. How well such studies reflect the 
clinicians’ actual gender bias is unknown. 

Another area of interest to sociologists has been the controversy over 
whether alcoholics who fail to achieve abstinence should be offered the 
treatment goal: of becoming a moderate drinker. Although the bulk of the 
research has been conducted by psychologists and psychiatrists, this issue is 
of interest to sociologists because it is relevant to the contrast between the 
intractability of alcoholism as reported in much treatment research and the 
high rate of recovery from alcohol problems found in social surveys. Roizen 
(1987) has recounted the history of the controversy, focusing on what he calls 
the three major episodes: the 1962 publication of Davies’ research reporting 
seven alcoholics’ successful return to controlled drinking; the 1976 Rand 
Report’s finding of normal drinkers among American alcoholics who had 
been treated at federally supported institutions; and the 1982 dispute over the 
findings of the Sobells regarding treatment outcome in patients specifically 
taught how to drink moderately. Roizen argues that the lack of an effective 
treatment for alcoholism is to blame for the controversy in the first place. The 
controlled drinking proponents, in suggesting that there might be another 
route to recovery besides abstinence, earned the wrath of organizations 
devoted both to abstinence and to the concept of a sharp division between 
normal and pathological drinking. Lengthier follow-ups recently suggested 
that the course of treated alcoholism is indeed fluctuating, and few “social 
drinkers” maintain that status for long; they progress either to abstinence or 
relapse (Helzer et al 1985). 

A more global approach looks at the meaning of treatment for alcoholism in 
the total social scene. Gusfield (1982) argues that in the welfare state, 
deviants (chronic alcoholics) are thought to be both reformable and 
rehabilitatable, and treatment resources to assist them in making these 
changes are provided. He claims that the “troubled persons industry,” which 
includes alcohol treatment personnel, is responsible for “the invention of 
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alcoholism” and has a stake in describing and maintaining this deviant group 
of alcoholics, just as they have a vested interest in maintaining the mentally 
ul, the ethnically disadvantaged, and the physically handicapped, whom they 
also serve. 

Baumohl & Room (1987) have carefully reviewed the heretofore neglected 
international history of alcohol treatment from 1840 to 1940. They conclude 
that self-help remedies rest more on faith than science, that class bias reserved 
treatment for the middle class alcoholic, and that medicalizing alcohol treat- 
ment may have advanced the medical profession more than it improved 
treatment. 

Research into what one sociologist has called the “social ecology of the 
American alcoholism treatment system,” that is, who gets treated, how they 
get there, and which subjects are overlooked (Room 1980), has been ad- 
dressed in a WHO-sponsored project surveying community subjects in Mex- 
ico, Zambia, and Scotland about their drinking practices and preferred in- 
terventions for problem drinkers. In summarizing the findings from this 
study, Rootman & Moser (1984) stress differences in community responses; 
for example, Zambians favor punishment over treatment, Scots favor treat- 
ment over punishment, and Mexicans are partial to both. Intervention by 
friends and neighbors was advocated in both Zambia and Mexico, while 
official agencies were preferred in Scotland. The study suggested that given 
the markedly different response patterns, solutions may not be transferable 
from one country to another. 


CONCLUSIONS 


Sociologists who study alcohol now generally agree that alcohol problems as 
well as drinking patterns formulated in the traditional quantity-frequency 
terms are worthy of study. They have even had a voice in the revision of the 
standard diagnostic nomenclature of the American Psychiatric Association 
and the International Classification of Diseases. While this may put them at 
risk of being coopted by psychiatry, they seem to have made at least as much 
contribution to psychiatric thinking as they have been influenced by it. 
Sociologists have also increasingly participated in cross-disciplinary sur- 
veys in which they develop and use standardized interview protocols which 
translate official psychiatric nomenclature into ordinary language. It may be 
too early to be confident that their collaboration with clinicians in developing 
criteria for and measuring the prevalence of alcohol disorders in the general 
community will mend the schism between the sociological and medical 
approaches to alcohol problems, but it seems to have helped. Through such 
collaboration, the clinicians’ information gathering techniques have been 
demystified, and through epidemiologic and family studies, clinicians have 
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been introduced to problems of assessing alcohol problems in nonpatients. 
Further, they have learned the value of the sociologist’s quantitative tech- 
niques for estimating prevalences and evaluating risk factors. Not least, the 
results of these collaborative studies have shown clinicians that alcoholism 
when measured with their own trusted tools (not by an arbitrary scale devised 
by a social scientist) is a heterogeneous entity with a fluctuating course— 
messages that until recently fell on deaf clinical ears. At least now there is 
agreement about what the data are, if not necessarily about their interpreta- 
tion. 

Sociologists have also learned something in the course of these col- 
laborative studies. They have learned about the pervasiveness of alcohol 
disorders in America as compared with most other countries, and that the rate 
of alcohol problems has markedly increased over the last 50 years. They have 
learned that alcohol problems have implications for all kinds of measures of 
social well-being, including crime rates, marital stability, job stability, and 
earnings. Perhaps these results will help bring alcohol research into the 
sociological mainstream and falsify the quotation with which we began. 
Alcohol research surely has that potential. It can offer opportunities to carry 
out and evaluate the experiments in social change that are the crucial test of 
social theory. It is relevant to almost all areas of traditional sociological 
interest, as the diverse set of topics we have attempted to cover illustate, and it 
intersects with religious and ethnic studies, with studies of social change and 
social movements, with theories of social control, with criminology and 
social deviance, with media research and analysis of social organizations, 
with study of age and gender roles, with medical Sociology, and with sociolo- 
gy of the work place. 

Sociologically relevant alcohol research of the last few years has been rich 
in the areas investigated, but it has as yet offered few answers. Yet the 
questions opened up are clearly worthy of further exploration and are suscep- 
tible to being answered with our current tools. The new diagnostic interviews 
mentioned above, the CIDI and CIDI-SAM, are being sponsored by the 
World Health Organization and will be available in many languages, making 
possible identical surveys in many cultures. These instruments can be sup- 
plemented with additional questions that test hypotheses about why there are 
enormous differences in rates of alcohol problems in different cultures and 
sub cultures and in different eras. 

In America, it should be possible to test the observation that alcohol 
problems are increasing disproportionately in the young and in women, and to 
learn why. Of special interest are findings that have changed over time. 
Drinking in high abstinence areas no longer appears to be associated with 
elevated rates of problems. Are the media responsible, bringing to drinkers 
everywhere a homogenized set of attitudes toward and meanings of alcohol. 
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We know that changes in drinking laws have an immediate impact, followed 
by a tendency to return to previous levels. What controls the duration of the 
initial effect? We know that few alcoholics seek treatment, but we understand 
poorly what the barriers are to treatment. Nor do we have evidence that the 
low rate of treatment-seeking makes any difference in the prevalence of 
alcohol problems. 

Drinking is a social behavior with a large number of sociologically relevant 
causes and consequences. Research in this area remains a promising and by 
no means yet exhausted field, which can illuminate issues of general interest 
to sociologists as well as to the specialist in medical sociology or deviance. 
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Abstract 


Traditionally, the study of the life course has been divided into research on 
different age groups, different life phases, and different life domains such as 
the family cycle, fertility history, occupational careers and employment, the 
dynamics of income and consumption, migration, and normative patterns of 
aging. The emerging field of theory and research on the impact of the state on 
the structuring of the life course highlights overarching and integrative mech- 
anisms for institutionalizing the life course. Therefore, the field constitutes a 
new’ analytical perspective rather than a specializéd area of research. This 
review attempts to make the theoretical perspective explicit and to collect the 
various contributions from very scattered research reports. The major empha- 
sis is macrosociological and theoretical. Examples are drawn from research 
on. childhood, education, military service and wars, public employment, 
retirement, and old age. Particular attention is paid to the historical aspects of 
increasing state regulation. The review is based on US and Western European 
literature. S 3 


INTRODUCTION 


The last 15 years have seen a growing interest and ant explosive increase in 
theoretical and empirical studies of the social organization of aging and the 
life course (Riley et al 1972, Riley 1979, Riley et al 1984, Riley & Huber 
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1988, Elder 1975, 1985a, 1986, 1987, Binstock & Shanas 1976, Rossi 1985, 
Sgrensen et al 1986, Baltes et al 1979, Kertzer & Schaie, 1989, Kohli 1978, 
Bertaux & Kohli 1984, Maddox et al 1987, Weymann, 1989, Rosenmayr 
1978, 1988, Rosenmayr & Kolland 1987, Voges 1987, Mayer & Tuma 1989, 
Clausen 1986). From the perspective of the internal intellectual dynamics of 
social science, this development may reflect, some exhaustion of Marxist 
socioeconomic approaches to social structure or of even more abstract sys- 
tems theories, and perhaps a reversal in the attention paid to the morphologi- 
cal and primordial aspects of age, sex, and territory. The resurgence of 
life-course studies is, however, also a reaction to a current societal process— 
the deep-reaching secular transformation of the alignment of the population 
process and social structure of advanced societies and the ensuing reorganiza- 
tion of the structure of the life course (Matras 1989, Pifer & Bronte, 1986, 
Kohli 1986a). 

In this renaissance of studies in the life course, the work proceeded first 
from conventional traditions of theoretical and empirical research: age differ- 
entiation (Riley et al 1972, Modell et al 1976), social and occupational 
mobility and status attainment (Featherman 1983), family demography (Bon- 
gaarts et al 1987), life cycle consumption and savings, and biographical 
research (Bertaux & Kohli 1984), human development (Clausen 1986) and 
life-span psychology (Baltes 1987). One of the characteristics of these 
approaches is that they parallel in many ways the differentiation of society 
into institutional subsystems (like the economy and the family) and of in- 
dividual life into various segmented life domains. Moreover, Hogan (1985) 
pointed out that these research divisions also tend to be gender specific: while 
status attainment and occupational careers were predominantly investigated 
for men, fertility histories and family cycles were exclusively treated for 
women. Also, the division of labor and thought within the social sciences has 
been particularly striking in the sphere of studies of the life course. Sociolo- 
gists have concentrated on age stratification and inter- and intragenerational 
social mobility, economists on life-cycle investment and consumption and 
human capital formation, demographers on fertility, and psychologists on 
“human development.” 

Interest in the role of the state in structuring the life course arose from the 
following questions: How are life courses in advanced societies shaped and 
regulated? How are the age-graded transitions between life-domains socially 
organized? How do life courses differ in contemporary societies from those in 
earlier societies? Which forces are shaping the allocation of life-time between 
life domains such as education, family activities, and employment? 

Once such questions are raised it becomes obvious that the conventional 
ways in which the life course is conceptually defined and empirically an- 
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alyzed pay little or no attention to the role of the state. While the state and its 
development figure prominently on the level of the larger society, individual 
lives are being portrayed as if they were occurring in a State-less social 
structure (Mayer & Miiller 1986:218—20). 

In terms of the state of theory and research this blind spot is even more 
noteworthy when we contemplate the longstanding interest (Marshall 1964, 
Bendix 1964, 1968, 1978, Eisenstadt & Rokkan 1973, Tilly 1975) and recent 
flurry of studies in sociology on the historical process of the rise of state 
intervention and regulation (Luhmann 1981, Offe 1984, Thomas & Meyer 
1984, Thomas et al 1987, Evans et al 1985a, b, Skocpol & Amenta 1986). 
One of the distinctive features of present-day developed societies is the extent 
to which the state, with its manifold corporate bodies and functions, has 
increased the volume of its activities and responsibilities and the way in which 
it pervades almost every aspect of social life (Boli 1980). The title “welfare 
state” would likely be the winning contender as the proper generic successor 
of the “industrial society” (against the rivals of “post-industrial” or “post- 
modern society”) (Wilensky 1976, Janowitz 1976, Flora & Heidenheimer 
1981, Flora 1979, 1988). 

One reason for this intensified sociological interest has’ been not only the 
internal contradictions of neo-Marxist theory (Lenhardt 1984, Lenhardt & 
Offe 1977, Offe 1984, Skocpol 1985), but much more the intensive debate in 
many ‘countries about alleged and harmful excesses of welfare state spending 
and intervention (Alber 1980, 1988, Mishra 1984, Eisenstadt & Ahimeir 
1985). The benefits of the welfare state, especially in regard to social security 
and equality, have in this debate been balanced against its alleged negative 
side-effects: a huge and growing financial burden for the economically active 
population (the “welfare crunch”), the concomitant growth of bureaucracy, 
and an alleged loss of individual self-responsibility and motivation. Many 
observers believe that state activity and state intervention have passed a peak 
and will decrease both in rate of growth and in absolute level (Alber 1988, 
OECD 1981, 1985 a,b, Matras 1989). But whichever direction the future 
development of the welfare state may take, state activity will have a large 
effect on shaping individual lives and the social structure of the life 
course. 

We suggest three reasons for the particular difficulties in building bridges 
between theory and research on the state and that on the life course. First, the 
meaning of the concept of “state” differs greatly in the European and in the 
North American traditions (Nettl 1968). In the United States the idea of the 
State tends to be narrowly conceived as “government” or “administration,” a 
kind of residual to the dominant civil society, and even the factual role of this 
kind of state tends to be ideologically played down. In contrast, the state in 
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Europe—and especially in France and Germany——has tended to dominate 
civil society.! 

Second, the analytical and empirical connection. between the state and the 
life course is difficult to define because the two perspectives are conceived on 
entirely different levels and within different time frames. In studies of the 
State, the nation-state is the unit of analysis, and the proper scholarship 
follows a cross-national and historical methodology. The major agents are 
other states or powerful elites within nations (Guillemard 1986a, b). Products 
of the activities of states are policies (Skocpol & Amenta 1986, Laumann & 
Knoke 1987, Quadagno 1988, Myles 1984), not people. In contrast, in 
studies of the life course the unit of analysis is the individual, and focus is on 
intra-individual change. Individual characteristics of former times are related 
to later transitions and outcomes. The major agents appear to be the in- 
dividuals themselves or conditions that can be conceptualized and measured 
on the individual level, as, for instance, conditions of the family of origin (see 
e.g. Hernandez 1986). Thus almost no direct empirical link exists between 
state organization and state activities on the one hand and the trajectories of 
individual lives on the other hand. Politics is usually conceived within the 
political system as a set of legislative measures or programs. At best it is 
conceived as being addressed to specific social groups and as having an 
empirically accountable impact on the distributions of life chances, such as 
the impact of changes in the tax system on net income. But politics is hardly 
ever conceived as intervening in the life trajectories of individuals. 

Third, among the major dimensions along which we conceptualize and 
measure the life course, there is no state dimension, and in the “states and 
activities” which define life events and transitions, the state rarely appears 


To explicate the differences in the meanings attributed to “the state” within Western Europe 
and the United States is difficult precisely because we all carry with us an already culture-bound 
understanding of the term. The best way to make this difference obvious is to examine how 
“sovereignty” is understood and how the relationship of civil servants to the state and to the public 
is defined. “Continental Europeans think of ‘sovereignty’ as residing in centralized administrative 
institutions; Britons focus on political parties in Parliament; and U.S. citizens refuse to designate 
any concrete body as sovereign, but instead attribute sovereignty to the laws and the Constitution” 
(Skocpol 1985: 22). In regard to the role of the civil service the best analysis has been provided 
by Bendix: “. . . in the United States government is considered a necessary evil” (Bendix 
1964 : 143), therefore public officials are mistrusted. It is assumed that they would primarily act 
in their own private or party interest. In contrast, German civil servants are legally bound in 
“Treuepflicht,” i.e. duty of faithful office. They are understood and understand themselves as the 
special guardian of the “Staatsinteresse,” an idea of public welfare above party politics. In line 
with this difference in the concepts of the state there seems to be a tendency in the US to focus, if 
at all, on the microlevel of very particular policies—such as the influence of welfare payments on 
employment. Jn contrast, in Europe the impact of the state on individual lives is so obvious that it 
is conceptually overlooked or relegated to applied administrative sciences (like Sozialpolitik in 
Germany and social administration in Britain). 


THE STATE AND THE LIFE COURSE 19] 


(except, e.g., for military service). Thus, again, the impact of the state on 
human lives cannot directly be observed or quantitatively analyzed. 

Therefore, an assessment of the role of the state in shaping individual lives 
is primarily an analytical and theoretical task, linking institutional arrange- 
ments and processes on the macrolevel of societies and the dynamics of life 
historiés on the individual level. The question is, how can one construct a 
conceptual framework that allows sufficiently precise inferences from state 
activities to the social structure of the life course? To the extent that such a 
framework can be developed, it will be possible to recognize. the relatively 
few instances where empirical research provides direct evidence of this 
relationship. | 

Ideally, in a systematic treatment, the structure and distribution of life 
courses should be compared between societies and historical periods of 
varying degrees and kinds of state organization and activity. Given the current 
state of the literature in the area, the goals of this review are much more 
modest. Empirical instances from various countries and historical periods will 
be used in a selective and illustrative manner. In regard to the conceptual 
framework we build on the few available treatments of the subject (Mayer & 
Müller 1986, 1989, Buchmann 1989, Thomas & Meyer et al 1987, Kohli 
1985, 1986a). 

Specifically, we are searching for answers to the following questions: What 
relationship exists between the development of early statehood, the modern 
welfare state, and “developed” structures of the life course? What are the 
specific modes and mechanisms by which state activities affect the life 
course, and how can these be conceptually reconstructed to demonstrate their 
impact? How do collective actions by the state change the logic of individual 
action in regard to life-course decisions? How does the activity of the state as 
a provider of transfer incomes and as a provider of jobs make its impact on the 
shape and distribution of life courses? 


THE DEVELOPMENT OF THE MODERN STATE 


Although forms of political organizations can be said to exist for all known 
societies in history, the modern state is a development distinctive both in its 
origin as a transformation of occidental feudalism and in its emergent struc- 
ture (Weber 1968, Hintze 1962, Eisenstadt 1980). The development of the 
modern nation-state is particular in the sense that it implied the evolution of a 
political corporate body on the basis of rational-legal construction, rational 
law, and a bureaucracy acting according to impartial laws and regulations. To 
understand the bearing of this specific development on the structure of the life 
course, it is useful to sketch in this section: (a) the evolution of state 
organization and state activities; (b) its impact on the rise of the individual as 
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the unit of action;? and (c) its role within the process of institutional differenti- 
ation of societies. In the following section we then trace the impact of the state 
on structural features of the life course. 


The Evolution of State Organization and State Activities 


Centralized nation-states are initially the products of attempts by rulers to 
consolidate and enlarge a territory and to establish central authority in a 
struggle against internal and external powers (Bendix 1978). In this process, 
such rulers aimed at monopolizing the means of power in the form of a 
standing army and—often as a means to provide for the army—in the form of 
an effective national system of tax collection. The relative success in es- 
tablishing centralized power was dependent on a parallel process in which 
dominant cultural symbols (religion, language, nationhood) were spread over 
the territory and were enforced against rival cultures represented by the 
church or regional cultures (Rokkan 1975). 

Political power was fostered internally by the introduction of universal 
rational rules for the exercise of authority, in the two forms of (a) generally 
applicable codes of penal and civil law, and (b) administrative bureaucracies 
composed of salaried professionals. Moreover, the political system differenti- 
ated itself from feudal and, later, mere class rule, and—with gradually 
increasing electoral participation—into the divided powers of government, 
judiciary, parliament, and party representation. 

For the purposes of this discussion, however, it is important to stress the 
ways in which the state reached beyond a narrowly conceived political sphere 
into the other domains of society (Marshall 1964, Luhmann 1981 :25-32). 
The state increasingly took over responsibilities for external and internal 
conditions of the economy (trade, transportation, communication, labor 
contract, freedom of coalitions) as well as responsibilities for mobiliz- 
ing resources for economic development that employed the law (labor) 
and the tax system (capital) and that made elementary schooling obligatory 
(skills). 

Furthermore, the power relations within society underwent a near total 
change (Coleman 1974, 1982, Elias 1978). Whereas, formerly, power was 
almost exclusively vested with heads of households or mostly local col- 
lectivities such as feudal families, cities, or church bodies, now the individual 
as a citizen and as an economic actor held power aggregated and mediated by 
national associations according to economic interests. Unions and mass par- 
ties formed the new intermediary powers between the citizen and the state. 

The state has not only evolved a tendency to control, substitute, and 
supplement markets, but willingly or unwillingly it has taken over the welfare 


*We are relying here especially on Flora (1976, 1981) and Breuer et al (1982). 
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responsibilities of the older associations and communities it had weakened or 
destroyed. Furthermore within the last 100 years the nation-state has taken 
over the responsibilities of social security and welfare for life risks and 
welfare (disability, unemployment, sickness, old age, medical care, housing, 
education). In the studies of Alber (1982), Flora et al (1977, 1983, 1987) and 
Flora (1986-1988), this gradual expansion has been systematically: described 
for the Western European welfare states. On the theoretical level the same 
process has been labelled one of political inclusion by Luhmann (1981 :25 ff). 
The welfare state has a tendency to become universal. Programs started for 
small and restricted groups at risk are gradually generalized to the whole 
population (see also Flora et al 1983). Moreover, there is no limit to the kinds 
of problems that might be taken up for social policy intervention. As a result, 
the scope and coverage in the provision of transfer incomes and services have 
greatly expanded, accompanied by a dramatic increase of employment in the 
public sector and the rise of a “service” class of administrative personnel and 
professional “caretakers” (Rose 1986).? 


The Social and Legal Construction of the Individual 


A first relationship between modern-state formation and the individual life 
course is that the former successively created conditions that single out the 
individual both as the object of state activity and as a distinctive, self-reliant 
actor. In destroying and weakening traditional associations and communal 
powers, and by centralizing authority, individuals begin to distinguish them- 
selves from their collectivities. Rights and duties are transferred from col- 
lective bodies to the individual (Coleman 1974, 1982, 1986). At first this 
transferral only pertained to the (male) heads of households, whereas current- 
ly more and more rights are transferred to children and women as well (Boli 
1987). Legal rationality establishes the individual as the prime holder of rights 
and duties and as the prime target of bureaucratic and administrative acts. In 
comparison with property revenue or subsistence farming, labor income goes 
to individual wage earners and their nuclear families, in contrast to larger 
solidary groupings. In addition, as transfer incomes benefit e.g. pupils, 
university students, wives, or live-in grandparents, rather than the head of 
household, individuals may opt to leave households or families or not es- 
tablish them in the first place (Hannan et al 1977, Bane 1983, Schultheis 


3To be sure, there are large differences between countries in their particular path of state- 
building (Tilly 1975), the degree to which they evolved toward a comprehensive welfare state, 
and its specific institutionalization (for Western Europe, see Flora 1986-1988). There are also 
apparent reversals, as in Britain (Parry 1987) and substitution arrangements, like more developed 
firm policies in the US (Kohli et al 1989), or the delegation of welfare function to the churches 
and other charitable organizations in the Federal Republic of Germany. But the overall process is 
so pervasive that—as a first step—it may be defensible to treat the life-course outcomes of this 
type of society as a generic problem and not one particular to this or that country. 


194 MAYER & SCHOEPFLIN 


1987, Van Stolk & Wouters 1984). Social security systems covering major 
life risks decrease the individual’s dependency on others and permit in- 
dividual life plans on the basis of stable expectations of income and need 
provision. Furthermore, national social insurance systems greatly enhance the 
individual’s ability to change places of work and residence, and this increases 
the likelihood of individual mobility. 

In regard to this linkage between state formation and the emergence of the 
individual actor there are two bodies of literature that deserve special men- 
tion. One is the “theory of the civilizing process” by Norbert Elias (1978).4 
Elias develops a theory of a process linking the development of the state with 
a particular social character. He postulates that a major consequence of the 
monopolization of power by a central state was a transferral of external social 
control to internal self-discipline. Physical constraint through others was 
replaced by internalized self-restraint of affects and behavior. Empirically 
Elias demonstrated his thesis by examining, among other things, the elabora- 
tion of eating and bedside manners, and of courting rituals, and the repression 
of aggressive behavior. Elias’ theory covers the historical period between the 
late middle age and the early renaissance (thirteenth to sixteenth century), but 
obviously has implications of a more general nature.’ 

The second body of literature stems from the so-called cultural or in- 
stitutional school of John Meyer and his associates (Boli-Benett & Meyer 
1978, Meyer 1986; Thomas et al 1987). These authors propose a noneconom- 
ic version of modernization theory. They conceive both the modern nation- 
state and the individual as social-cultural constructs, as “projects” of a 
rationalization process of world-wide diffusion: 


“The rise of the nation-state subordinates not only the world exchange system, but also the 
status of the individual . . . The individual loses validity as an autonomous effectual actor . 
in the system, but does not necessarily lose value. For example, individual rights are 
expanded, but are not primary, for they are derived from individual membership in the 
state. Modern state accounting has individuals as:human capital for societal purpose. 


4Elias’ study on the civilizing process was written in 1936 and first published in German in 
1939, but has received increasing.attention only in recent years. The English language edition 
was published i in 1978. 

5There have been a number of attempts to apply Elias’ method of * ‘configurational analysis” t 
current developments (Gleichmann et al 1977, 1984); but neither Elias nor his growing number of 
followers have been able to apply the theory in a more than sketchy manner to contemporary 
conditions. Von Ferber (1984) comes closest to this task when he parallels the’late medieval 
process of the monopolization of physical force by the central state and the psychogenesis of 
shame with the capitalist monopolization of the means of production and the rise of the bourgeois 
family, the monopolization of welfare provision for sickness and old age with the rise of ‘the 
nuclear family in the proletariat, and finally, when he parallels the monopolization of occupation- 
al career. opportunities. via the educational system with the corresponding emancipation of women 
from the family. Gleichmann et al (1984) contains an extensive Pibiography on the works of 
Elias and the reception of his theory. 
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Like other objects of value, they may receive even more protection than in classic 
individualism” (Meyer 1987 : 67). 


Meyer and his associates succeed in constructing very plausible in- 
terrelationships between a world-system “project” of the nation-state on the 
one hand, and the construct of a welfare-dependent individual, the extension 
of human rights, and the expansion of general education and of childhood on 
the other hand. Their work is important in following through the logic of 
Bendix (1964), Flora (1976), Parsons (1970), and Marshall (1964) by extend- 
ing citizenship rights to economic, social, and cultural rights. This extension 
of the definition of rights as entitlements provides the basis for claims for 
equality, the political definition of ever new inequalities, and thus legitimiz- 
ing welfare state intervention (see also Luhmann 1981). But this recent body 
of literature is probably more important because it stresses that the state is 
more than its administrative bodies; it is—just as is the individual—a cultural 
construct of considerable impact. It is noteworthy, however, that on such a 
level of generality the state-linkage does not appear at all in Meyer’s piece on 
the “self and the structure of the life course” (Meyer 1986). 


Institutional Differentiation and Integration 


A second relationship between modern state formation and the individual life 
course must be seen within the context of societal differentiation (Mayer & 
Müller 1986:222, Mayer 1986). The traditional household was the primary 
locus of production and consumption, of reproduction and family life, of 
socialization and training, and of political authority (Weber 1968). The legal 
separation between the household and the firm, and the functional division 
between the family and the school, directly translate themselves into seg- 
mented roles of the individual, who ceases to be exclusively a member of a 
primary group. This differentiation is the precondition of the structure of the 
life course in terms of variable participation in segmented roles over the life 
time. This process may be termed the mapping of the institutional differentia- 
tion on the macrolevel of societies onto diachronically ordered segments of 
the life course (Luckmann 1975). 

This is probably the most common way of thinking about the social 
structure of the life course. Smelser & Halpern (1978) developed this idea as 
the triangularization of life between school, family, and work. To the extent 
that participation in these various domains is organized in a strictly sequential 
manner across the life time, there is then a wpe way of depicting correspond- 
ing social stages of life (Buchmann 1989).° Along these lines Kohli (1985, 
1986a)—and much earlier Albers (1973)—has developed the concept of the 


One appeal of this conceptualization of the structure of the life course is that the allocation of 
total life years between these domains or main activities can be measured and graphically 
displayed in. life time ‘budgets (Modell et al 1976, Blossfeld 1987, Imhof 1981). 
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tripartitioning of life; the three major life stages are organized around gainful 
employment: the phase of education and training as preparation for work, the 
phase of active employment, and the postretirement years. He thus sees the 
social organization of labor as the prime determinant of the structuring of life 
courses in modern societies. In their qualitative biographical studies on the 
transition to retirement, Kohli & Wolf (1988) have assembled considerable 
evidence that this macrostructure is also constitutive of the subjective mean- 
ings of the individual actors. In biographical accounts the pre- and post- 
employment years tend to derive their subjective meaning from the work 
phase. 

The secular transformation of life courses is from a situation where the 
agrarian household economy imposed a much less stringent and predictable 
scheduling of the life time to a situation where an ever greater proportion of a 
population flows through these stages. This transformation has also been 
highlighted by Tamara Hareven’s distinction between “family time” and 
“industrial time” (Hareven 1977, 1982; see also Anderson 1985). The ex- 
igencies of a preindustrial family’s economic self-sufficiency under con- 
ditions of highly fluctuating yields and high mortality prompted family 
members to accommodate their personal life plans to the needs of the house- 
hold. Such accommodations are seen clearly in such historical patterns as the 
transition to adulthood (attending school, leaving home, marrying) (Modell et 
al 1976, Ankarloo 1978). During the early stages of industrialization—the 
economic survival of the family depended on a joint strategy that sent women 
to work, withdrew children from work, or aided the migration and placement 
of kin by the changing economic circumstances of the family (Featherman 
1989, Mayer & Miiller 1989). 

Gradually, the correspondence between family time and industrial time was 
replaced by one between “industrial time” and “individual time.” The tempo 
and scheduling of the individual life course could be based more on individual 
considerations and less on the family unit. At the same time, the scheduling of 
life was increasingly determined by a new set of institutions, outside the 
family, that determined the sequence of roles into and through which persons 
passed as well as their respective durations (Featherman 1989). 

The early interpretation by Hareven and the later one by Kohli coincide in 
the view that more modern life courses are driven by the market economy. 
Featherman (1989), in contrast, challenges the implied thesis that the 
restructuring of life courses followed a straightforward “logic of in- 
dustrialism”; he develops a much more complex argument introducing pop- 
ulation growth and labor supply, popular theories about human development, 
problems of social control of youth, and the substitution of lineage by 
age-grading or intergenerational transmission by cohort succession. 

But what is the role of the state in this process of institutional differentiation 
and its mapping onto the life course? On the level of the society, the 
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development of the institutions of the modern state has been interpreted as the 
main mechanism for integrating and regulating the divided institutional do- 
mains of the family and the economy (Flora 1976). In an analogous manner, 
one should, on the level of individual life histories, look not so much for 
“political” roles or “political segments” of life time, but for state influences on 
the regulation of the transition between domains and on the integration of 
sequential social roles. Thus, for instance, the legal framework of the labor 
contract or the laws against child labôr (Smelser 1959) accentuated the 
cleavage between family and work, as well as the cleavage between the work 
life and periods before and after the work life. The introduction of compulsory 
schooling and optional further education is the most telling instance of state 
construction of the life course in its role in reacting to latent consequences of 
institutional differentiation. The state often intervened to make up for the 
weakened ability of families to socialize, train, and control children. This was 
especially pressing in the periods of early industrialization when children had 
finally been excluded from work but mothers were still drawn into it (Smelser 
& Halpern 1978). Recent studies of social historians demonstrate clearly that 
for the American case the introduction of education was highly contingent on 
the demand for labor and problems of controlling idle youth (Kaestle & 
Vinovskis 1978, Katz & Davey 1978), rather than primarily motivated as a 
period of preparation for work. The historical case was, however, very 
different in Germany where the interests of the state were the motive of 
introducing and expanding education—long before industrialization got under 
way (Leschinsky & Roeder 1976, von Friedeburg 1986). 

_ This brief review emphasizes the historical variability and causal hetero- 
geneity of state-induced structural conditions for the individual life course. 
Some of these conditions are latent consequences of the earlier establishment 
of the centralized nation-states as legal-bureaucratic institutions. Some are 
unintentional effects of state intervention in fostering industrialization and 
economic development, and some are results of purposive action to increase 
individual security and to cure the “ills of capitalism” (von Ferber 1976). 
Finally, there are the intentional and unintentional results of increasing mate- 
rial and nonmaterial welfare through income distribution and the provision of 
service in the fully developed forms of the welfare state. It is clearly in- 
sufficient and hardly promising for research merely to juxtapose a macrocon- 
struct “state” with a similarly generalized macroconstruct “individual” or “life 
course.” We therefore now take a closer look at the major mechanisms by 
which activities of the state affect the structure of the life course. 


MODES OF STATE ACTION AND THEIR IMPACT ON 
THE STRUCTURE OF THE LIFE COURSE 


Everyday observation shows an abundance of the concerns of the modern 
state with particularities of different phases of the life course. The state 
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provides nurseries for small children, designs schools for older children, sets 
rules for motherhood protection, legislates retirement rules, and provides care 
forthe elderly. But we hardly find manifest instances of explicit policy and 
state action relating to human life as a whole or even conceiving its: policies as 
having later life impacts (Hirschhorn 1977, Brim & Phillips 1988). In tracing 
state effects on the life course one therefore has to ask: How do life course 
structures change when the state rather than primary social groupings, is in 
charge of providing for basic-needs and security, and of regulating most 
spheres of society? What are the formal properties that restrict the action 
repertoire of the state and produce a specific selectivity in the way problems 
are dealt with? These properties result from the instruments of action available 
to the state (Mayntz 1979), the rules for legitimate use of them, and the 
organizational apparatus employed. 


Regulative Norms and Age Stratification 


Evidence for state effects on the life course was collected in a conventional 
way that relates to age rules in legislation, e.g. in regard to age at school 
entry, prohibition of child labor, legal marriage age, and the legal retirement 
age (Cain 1976, Eglit 1985, Neugarten 1982, Riley et al 1972). There is no 
doubt that such age norms had and have considerable impact. Yet, the 
available evidence shows that their importance may have been greatly over- 
estimated. Age-related legal prescriptions per se are frequently more apparent 
than real (Mayer & Müller 1986: 226-30). For instance, in Germany it took 
almost 200 years before compulsory schooling ages were actually followed 
throughout the population (Leschinksy & Roeder 1976). It has been shown. 
that legal ages at retirement followed rather than shaped social practices 
(Stearns 1977, van Tassel & Stearns 1986) and rarely coincided with actual 
age at retirement. Especially within the last decades in many Western coun- 
tries a rapid fall occurred in the actual age at retirement, and only slowly 
following legal age norms. In the United States, age at retirement tends to fall 
even despite legislation to the contrary (Guillemard 1985, 1986 a,b, Schmähl 
1988, Jacobs et al 1987, 1989, Kohli et al 1989). As Mayer & Müller 
(1986: 229) have argued, in the context of the introduction of the German old 
age pension system, legal age norms are only effective if they are combined 
with other means, e.g. sufficiently high pensions, or the provision of services 
as in: schools. l 

However, there.is some evidence for the temporary or pervasive influence 
of legal age norms. Papastefanou (1988) shows that the age at marriage for 
women fell significantly when the legal age for adulthood was lowered from 
age 21 to age 18 (in the year 1972), although the legal age for marriage of 
women was unchanged at age 16. Blossfeld (1987, 1988) gives remarkable 
evidence that the quasi-legal age structure of entries into and exits from 
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schools was virtually unaffected by the end of World War II and its aftermath. 
Age norms persisted, although the state order had actually collapsed. 

The current legal debate about age discrimination (National Conference 
1981, Eglit 1985) marshals evidence both for the felt impact of legal age 
norms and an awareness of their inappropriateness. However, like the discus- 
sion about sex discrimination, it probably accentuates rather than weakens 
distinctions based on age. 


hars 


Entitlements and ihe Standardization of Life Events 


One of the main ways in which the state affects the life course is that legal 
rules define universal and not case-wise applicability and entitlements. Uni- 
versalism is at the core of legitimation for state activities. Whatever action the 
modern state is taking, it has to be governed by or based on general laws or 
administrative decrees, uniformly applicable. for all citizens fulfilling certain 
criteria (von Ferber 1979). This marks the transition from the liberal to. the 
welfare state. Formal rules of eligibility require rigid definitions of “life 
situations.” The risks covered by social insurance legislation are being treated 
in an all-or-nothing manner: one is either employed or out of the labor force, 
one is either healthy or on sick leave, one is either in the period of maternity 
protection and not working or is fully employable. 

In contrast, in prewelfare societies transitions between being a child and 
working, between working and being retired, between being ill and a healthy 
worker were very: gradual. There were only few sharp life transitions. As 
Schröter (1984) has shown, until-the late Middle Ages even the transitions 
between marriage and nonmarriage were often not clearly marked. In the 
welfare state, then, the continuous flow of life is transformed into a series of 
situations all of which have a clear formal definition. At any given time in 
life, a person has a welfare status which defines personal obligations and 
rights with respect to the different institutions, most notably as either payer or 
receiver. The periodization of life and the proliferation of sharp transitions 
which derive from the social insurance systems combine with those deriving 
from general schooling and vocational training into a life-long biographical 
pattern.. Moreover, the tendency of welfare state to cover whole populations 
ensures.a society-wide universalization of welfare state—type life-course pat- 
terns (Mayer & Miller 1986: 235). 


Bureaucratic Administration and Service Bracketing 


The welfare state transformed societies from systems in which individuals and 
families responded in a particularistic and gradual manner to situations of 
need into a system of entitlements governed by general rules (Hareven 1986a, 
b). But this is only one side of the replacement of the family and primary 
groupings by the state. The other relates to the organization and daily im- 
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plementation of the welfare state. As in any large-scale organization, division 
of labor and well-defined areas of competencies are basic characteristics of 
the state. Therefore, if provided by the state, the care for the needs and risks 
of life has to be segmented into specific organizations, competent and autho- 
rized for the various types of needs and risks. In this way, a segmentation of 
offices as well as a segmentation of professionals and clienteles is brought 
about. The transition in more and more domains of life from laymen provid- 
ing comprehensive care to professionally authorized caretakers and special- 
ized service personnel demonstrates this ongoing process (Mayer & Müller 
1986:234-235). 

Age groups are a favorite way of dividing up social problems, for pro- 
fessionals and clienteles, because their boundaries are administratively easy to 
define. Important contributions in this area are the recent studies by Heyns 
(1987, 1988) on the increasing specialization of medical doctors, nurses, and 
other professionals in regard to childhood (for old age, see Haber 1986). 
Heyns documents not only this segmentation but also the parallel between the 
social hierarchy of the professionals and the layers of age stratification. The 
prestige of geriatricians and pediatricians tends to be lower than, e.g., that of 
surgeons, and the proportion of women tends to be higher in the lower 
prestige groups. 

This configuration of service bracketing for age homogeneous “target 
groups,” specialized professionals, and age-specific criteria for financial 
support for these services is reinforced by strong beliefs in the existence of 
important psychological, social, and organic differences between age groups 
(or vice versa). 


DIRECT EFFECTS: PUBLIC EMPLOYMENT, MILITARY 
SERVICE, AND WARS 


Up to new we have examined the latent and unintended consequences of state 
activities on the institutional structure of the life course, but there are obvious- 
ly also quite direct effects. One major effect must be seen in the rapid increase 
of public employment (Rose 1986). Although the distinctive life chances and 
habitus of civil servants is a long-standing topic in sociology, the differences 
between public and private employment are probably diminishing (Mayer & 
Müller 1986 : 238-42). Recent studies on career and income trajectories in the 
private and public sectors demonstrate remaining differences (Visher 1984, 
Carroll & Mayer 1986).. : 
States intervene most drastically in the lives of their citizens when they 
declare war. In so doing they interrupt the life course by drafting large 
proportions ‘of cohorts into the military, leaving: many of them disabled and 
traumatized for the rest of their lives. Beyond that, the lives of the civilian 
population are threatened in their material existence and exposed to disloca- 
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tion and disruption. Recently the effects of military service and war experi- 
ences have been examined in regard to their longer term impacts on the life 
course (Marini et al-1987:88, Hogan 1981). Mayer (1988a), Elder & Bailey 
1986 and Elder & Meguro 1987 have shown for the US, German, and 
Japanese samples that the later disruptive effects of military service and entry 
into schooling or the labor market during war times depend much less on the 
length of service, than on whether such deprivations occur at crucial times in 
the educational and occupational careers of citizens. Hoerning (1987) reports 
in a qualitative study on a case of state intervention in the “Cold War,” on 
life-course effects brought about by the building of the Berlin Wall. In a 
similar manner she documents the impacts that vary with prior resources and 
age or career stage at the occurrence of the event. In most of these studies, the 
theoretical model on the differential impact of early deprivation, developed by 
Elder (1974) in his Children of the Great Depression, has been putty and 
successfully applied. 


FUNCTIONAL RATIONALITY AND THE LOGIC OF 
LIFE DESIGNS 


A particular theoretical and empirical puzzle arises from the following ques- 
tions: Which subjective life plans and biographical reviews prevail under the 
conditions of differentiated life courses? What impact has the state on the 
level of “the vocabulary of motives”? Both the extension of the life span and 
the aversion of life risks through the insurance systems of the welfare state 
should foster stable expectations, long-term calculability, and deferred 
gratification patterns. The more marked changes and transitions in life should 
increase biographical awareness (Bertaux & Kohli 1984, Kohli 1986c). In 
contrast, it has been argued that: 


Given the relative independence and heterogeneity of institutionally defined logics of 
action, it seems unlikely that individuals are able to form comprehensive and long-term life 
designs. It seems also unlikely that individuals would be able to stick to life plans 
developed at the outset of adult life and to carry them through successfully. Because the 
linkages between and within “careers” in different institutional domains are highly pre- 
determined, there seems to be structural support for a logic of the present situation rather 
than a logic of a persistent individual life design, according to Luckmann (1975): ‘Large 
segments of the institutional order that directly determine much of social interaction in our 
everyday lives became increasingly disengaged from the biographical context of meaning 
in which most social interaction was traditionally located . . . An individual’s ‘social’ 
existence came to. consist, in considerable part, of a series of performances of specialized 
and anonymous social roles. In such performances the person and the biographical contexts 
of meaning tended to become irrelevant (p. 11). l 


(Mayer & Müller 1986: 235-38, 242; Mayer 1988b). 
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Accordingly Meyer (1986) has postulated the segmented selves corresponding 
to the segmented life course, and Schimank (1988) has coined the term 
“biographical incrementalism” for this new logic of the present situation. 

On the specific influence of welfare state intervention Mayer & Müller 
(1986) developed the theory that the legalistic and monetary nature of welfare 
provisions fosters “functional rationality” as opposed to “substantive rational- 
ity” (a distinction made by Mannheim 1940:53/56). Wherever the state 
establishes rules, provides services, or offers monetary incentives, it is func- 
tionally rational for individuals to make use of such opportunities: This may 
lead to decisions concerning the individual organization of the life course 
which might be discrepant from actual individual needs or initial life designs. 
Mayer & Müller (1986) give only illustrative examples supporting this argu- 
ment, but corroborating evidence can be found in Van Stolk & Wouters 
(1984) and Schultheis (1987). Schultheis studied a program encouraging 
unwed pregnant women to bear their children by providing financial assis- 
tance for a limited period of time (4 years). The financial aid stipulated 
staying at home with the child and prohibited cohabitation with partners. He 
concludes that the program led to passivity, a reduction of long-term plan- 
ning, and reduced investment in education, and thus had disastrous con- 
sequences for the later life chances of these women.’ 


UNRESOLVED PROBLEMS 


The first question we should raise at the end of this review concerns the 
specificity of the impact of the state on modern-day life courses: Are the 
influences of the state just one other expression of the bureaucratization 
process? Here we should point out that the state is certainly only one of the 
agencies which impose order and regularity on life courses. But besides the 
effect of the modern corporation for those who work in big firms, the impact 
of the state is certainly the most powerful one. It is probably the most 
pervasive, because its impacts tend to become universally applicable for total 
populations, as—for instance—-social security, retirement legislation, or 
maternity leave. Thus, the state is providing a general framework for in- 
dividuals progressing through their lives. 


7For a dissenting view in regard to the quality of care for children and the elderly with 
increasing state involvement, see Bane (1983). : 

Theories about the reconstruction of motives under the impact of the welfare state can also be 
found in Luhmann (1981) and in Habermas thesis of the “colonialization of life-worlds” by the 
systemic institutions of modern societies (Habermas 1984). In this context one might also refer to 
the controversial debate and research findings in labor economics about the impact of unemploy- 
ment benefits on the labor supply (Sørensen 1989, Hujer & Schneider 1989) and in demography 
about the effectiveness of financial benefits in increasing fertility (Dinkel 1984, Schubnell 1975, 
Rein 1983). 
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A second question concerns the balance between the integrative and 
segmenting influences of the state on the life course. As persons proceed 
through their lives, the state defines most of the ports of entry and exit, into 
and out of employment, into and out of marital states, into and out of 
sickness- and disability leave, into and out of education and vocational 
training. In doing so, the state turns these transitions into public life events 
and interlocks organizationally and functionally differentiated domains of 
society as people flow through them. Thus, it seems that the state contributes 
to the constitution of continuous and coherent lives, not only biologically, but 
also socially. However, in contrast to these integrating tendencies, we have 
seen that in this very process the state accentuates transition events, age 
brackets, and life segments, and provides external rather than individually 
created order throughout life. In this way the state clearly contributes to a 
differentiated structure of the life course. By increasing individual security 
and by lessening dependency on others, the social bond is weakened and 
individual lives are separated from the collectivities of family and kin. People 
can take the risk of changing employment, divorcing partners, or migrating. 
By removing economic restraints, individual mobility options increase. 
Moreover, the welfare state provides continuity over the life course by 
preventing sudden and steep income losses through health and unemployment 
insurance, and by redistributing income over the life time through old age 
insurance. Also, the welfare state provides for people working in the public 
sector specific education-occupation linkages, stable employment, orderly 
career lines with secure and progressing income. 

Whether the welfare state produces segmentation within the life course or 
provides integration and continuity, there is no clear answer. While both the 
integrating and differentiating influences of the state are considerably 
structuring the life course, we have argued that the incentive systems of the 
welfare state are weakening the likelihood of integrated life courses at the 
level of the individual biography. But analytical concepts need to be formu- 
lated to characterize these seemingly contradictory influences on the life 
course of individuals. 

We suggest empirical research to progress in three areas: First, in- 
tranational, microanalytical studies on the impact of particular state in- 
terventions and their consequences on life-course decisions and outcomes. 
Examples are the effects of unemployment insurance on labor market exit and 
reentry, of child allowances on fertility, of alimony payments on divorce, of 
tax advantages on marriage, and labor force participation of women, of 
changes in old age pensions on early retirement. A number of such studies 
have been reviewed here, but systematic evidence is still scanty and on the 
whole not conclusive. Cumulative evidence over longer periods of the life 
span would be helpful for assessing the long-term effects of state regulations 
on individual outcomes. 
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Second, cross-national comparative research is needed to understand how 
different policy systems pattern life courses. Furthermore, only a comparative 
perspective allows us to isolate the effect of a single policy act on life 
changes. Variations in life course structure between countries should closely 
match their differences in degree and kind of social policies. For example: 
What is the variability of life outcomes under conditions of a fully developed 
welfare-state, composed with a less developed one? What does the aging 
process look like in a society with generous old age pensions, compared with 
a society with small and inadequate. pensions? How do -political decisions 
affect occupational choices and careers of women? Such research questions 
can be addressed by exploiting cross-national data banks on nationally repre- 
sentative, individual level micro-data, e.g. the Luxemburg Income Study 
(Buhmann et al 1988), the available life-history, e.g. from Germany, Nor- 
way, and the United States (Allmendinger 1988), and panel studies, e. g- 
Elder 1985a, b, Krupp & Hanefeld 1988. 

Finally not only should the impact of social policies on individual outcomes 
be investigated, but also, on a more general level, attention must be paid to 
how social policy acts are shaped by individuals. The micro-macro path could 
be studied with regard to women’s labor market participation and changing 
occupational segregation, or to expanded schooling and tightened selection 
criteria for career entry. By integrating the level of individual action to 
macro-social functioning the study of the full relationship between the state 
and the life course is made possible and would link theoretical issues with 
empirical research. 
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Abstract 


The immigrants to the United States since 1965 are overwhelmingly an urban 
‘population; they have converged on a. small number of large metropolitan 
areas. This article describes the characteristics of the new immigration and its 
geography. It then focusses on the key immigrant-receiving metropolitan 
areas and discusses the relationship between ‘the restructuring of their 
economies and land markets and the employment and settlement patterns of 
the new immigrants. 


INTRODUCTION 


Though the study of cities was largely the study of immigrants during the 
formative years of American sociology,- this is.certainly no longer the case 
today. The interested reader would search in vain through back issues of the 
‘Urban Affairs Quarterly for studies of the new immigrants who have settled in 
America’s cities; and the related journals supported by our sister disciplines 
are equally prone to give immigration research short shrift. Immigration 
specialists, on the other hand, are no more attentive to urban issues. In a 
sense, much of the sociological research on the new immigration to the United 
States is about people who just happen to live in cities; but how the particular 
characteristics of the immigrant-receiving areas impinge on the newcomers is 
a question immigration researchers rarély raise. 

What accounts for this inattention to the urban‘aspects of immigration is not 
entirely clear. Whatever the causes, there’can be no question that immigration 
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is a fundamentally urban phenomenon. New immigrants—those newcomers 
who have immigrated to the United States since 1965—are disproportionately 
concentrated in central cities, and they are overwhelmingly a metropolitan 
population. Moreover, the new immigrant stream has converged on Amer- 
ica’s largest urban places: in 1980, four of every ten new immigrants lived 
either in the greater Los Angeles or in the New York area..Elsewhere, as of 
the 1980 census, only a scattering of large cities could count substantial 
foreign-born populations. 

This essay is concerned with the icanon between the new immigration 
and the changing economic and ecological characteristics of the cities in 
which they have settled. Given the extraordinarily localized dimension of the 
new immigration, only a handful of urban areas receive attention. These 
areas—New York, Los Angeles, Chicago, San Francisco, and Miami—are 
selected for two reasons: they comprise the largest concentrations of the new 
immigrant population (containing, together, 46% of all 1965-1980 immi- 
grants counted in the 1980 Census); and they occupy similar positions in the 
hierarchy of cities.! 


IMMIGRATION TRENDS 


The new immigration to the United States began with the passage of the 
Hart-Cellar Act in 1965, which abolished the old country of origins quota, 
affirmed family connections as the principal basis for admission to permanent 
residence in the United States, and increased the total numbers of immigrants 
to be admitted to the United States (Reimers 1985). Despite a number of 
changes, this system has essentially remained in place to this day. The major, 
entirely unanticipated, consequence of the Hart-Cellar Act was the dramatic 
increase in immigration from Asia, which has become the largest regional 
source of legal immigrants. While arrivals from Europe have fallen off 
sharply over the past 20 years, immigration from the Caribbean and Latin 
America has been on the upswing. The size of the legal immigration flow has 
also increased. Between 1966 and 1970, an average of 374,000 newcomers 
entered the country each year; between 1982 and 1986, by contrast, annual 
inflows averaged 575,000.” 

In addition to the legal immigrant flow have come substantial numbers of 
undocumented immigrants-—people who either cross the borders illegally, or 
enter the United States legally but extend their residence beyond their legally 


‘Given space constraints, and the concern with both immigration and urban change, justice 
cannot be done to the diversity of the new immigrant population. In general, greater attention is 
paid to the various groups of Hispanic immigrants; this essay does not attempt to treat the 
experience of new Asian immigrants, for which a separate review article might well be appropri- 
ate. 

*Calculated from US Department of Commerce, 1987, Table 1. 
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permitted stay. Just how many undocumented immigrants have been living in 
the United States has been a matter of controversy for over two decades. For 
many years, wildly disparate, often politically inspired estimates and guessti- 
mates have been current, with upper and lower bounds at 12 and 2 million, 
respectively. More recently, a consensus appears to have settled on a 2 to 4 
million range (Bean & Tienda 1987). Warren & Passel (1987) estimated that 
approximately 2 million undocumented persons were counted by the census in 
1980. Although the 2 million figure is probably a lower range, it does not 
appear likely that a large proportion of the undocumented was left uncounted 
in the census. The Warren & Passel estimate suggests that Mexicans com- 
prised over half of the undocumented population present in 1980. 

Post-1980 estimates have been developed using the 1979, 1983, and 1986 
Current Population Surveys. These estimates (Passel & Woodrow 1987, 
Woodrow et al 1987) point to modest growth in the undocumented popula- 
tion, with annual net inflows of about 100,000 to 300,000 persons, yielding a 
1986 undocumented population of a little over 3 million. The national origins 
of this population appear little changed from 1980, with Mexicans and other 
Latin Americans heavily predominant. 

The most important piece of immigration legislation since the Hart-Cellar 
Act was the Immigration Reform and Control Act of 1986, which provided an 
amnesty for undocumented immigrants who had resided continuously in the 
United States since January 1, 1982, and imposed sanctions against the 
employers of illegal immigrants.? The bill established a 12-month period, 
from May 4, 1987 to May 5, 1988, during which eligible undocumented 
immigrants could apply for legalization. As of December 1988, close to 1.8 
million applications by pre-1982 undocumented immigrants had been filed 
(US INS 1988), indicating that a sizeable portion of the estimated 2.5 to 3.5 
million undocumented immigrants counted in the 1980 Census did not avail 
themselves of the legalization opportunity. For reasons that are not clear, 
Mexicans are considerably overrepresented among the pre-1982 legalizees (a 
group that excludes immigrants eligible under a special provision for agricul- 
tural workers), more so than they are among the counted undocumented 
population (US INS 1988) 


THE GEOGRAPHY OF THE NEW IMMIGRATION 


As in the past, immigration has been disproportionately urban-bound. In 
1980, 40% of the new immigrant population lived in the SCSA of either New 
York or Los Angeles (with about 20% in each area); thereafter, concentrations 


>There are, in fact, three additional subgroups for whom the legalization criteria and deadlines 
differ from the main, pre-1982 group. The most numerous of the subgroups are undocumented 
immigrants who can show that they worked in agriculture for at least 90 days during 1984, 1985, 
and 1986. See Meisner & Papademetriou 1988. 
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dropped sharply. 1980 Census data for all SMSAs and for the ten metropoli- 
tan areas with the largest new immigrant populations are displayed in Table 1. 
The table underscores several facets of the new geography of immigration: 
The recent arrivals are overwhelmingly to be found in metropolitan areas, 
where they live, to a disproportionate extent, in central cities; the newcomers 
are even more overrepresented in the ten principal receiving areas; only Los 
Angeles and New York contain sizeable proportions of the total immigrant 
population; and only in occasional instances do immigrants significantly alter 
the demography of an area. Indeed, El Paso, Honolulu, and Jersey City, were 
the only other areas among the 100 largest SMSAs to contain new immigrant 
populations exceeding 10%.* | 

These leading SMSAs vary in the diversity and composition of their new 
immigrant populations. Of the five main receiving areas, three are dominated 
each by a single origin group: Mexicans in Los Angeles and: Chicago, and 
Cubans in Miami. These comprise 47%, 32%, and 59% respectively of 
1965-1980 immigrants in those areas. By contrast, San Francisco is diverse, 
with the largest group, Philippinos, making up 19% of the new immigrant 
population. New York’s new immigrants are even more heterogeneous, with 
no group accounting for more than 10% of the newcomers.” 

The distribution of the undocumented population differs significantly from 
the geography of the new immigrant population. Passel & Woodrow (1984) 
estimated that almost half of the undocumented population counted in 1980 
lived in California, with the next largest concentration, New York, containing 
only 11.4% of the total. Eight other states with the largest numbers of counted 
undocumented aliens accounted for roughly seven eighths of the total. SMSA 
estimates (Passel 1985) underline the distinctive spatial pattern of the un- 
documented, more than a quarter of whom live in Los Angeles where they 
make up half of the post-1960 immigrant population (See Table 1). 

The geography of the population taking advantage of the legalization 
program is more skewed still, though consistent with the national characteris- 
tics of. the legalizees, as noted above. As of December 1988, 55% of the 
pre-1982. applicants lived in California, with another 18% residing in Texas. 
By contrast, New York accounted for less than 7% of the applicants, well 
under its share of the presumably eligible population (US INS 1988). This 
pattern of geographic differences of over- and under-representation points to 
interesting and as yet unexplored disparities in the characteristics and modes 
of incorporation of the immigrant streams feeding into the major receiving 
areas (see Meisner & Papademetriou 1988:91—104). 


“Unfortunately, the Immigration and Naturalization Service has published only fragmentary 
data on the intended places of settlement of post-1980 immigrants; this information suggests that 
there has been little change in the importance of the largest immigrant-receiving areas. 

>Based on calculations from the 1980 Census of Population, Public Use Microdata Sample. 
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THE URBAN POSTINDUSTRIAL ECONOMY 


The paradox of the new immigration is that the recent wave of newcomers 
converged on cities just when the cities appeared to be undergoing their period 
of most severe decline. Although the old central cities of the nation’s heart- 
land had been declining slowly since the end of World War II, urban fortunes 
took a radical turn for the worse after 1969. Central city decline, a number of 
observers noted, was paralleled by and partly attributable to a vast redistribu- 
tion of people -and resources from the northeast to the south. The basic 
argument was first struck by Sale (1975), who contended that the new growth 
poles of the economy—agribusiness, defense industries, advanced technolo- 
gy, oil and natural gas production, real estate and construction, and tourism 
and leisure-~-had become concentrated in the states of the “southern rim,” 
bringing in tow a massive shift of people and power. Sale’s somewhat 
anecdotal account was.accompanied by a number of more academic treat- 
ments (Sternliecb & Hughes 1975, Perry & Watkins 1977, Sawers & Tabb 
1983) that substantiated the thesis of interregional economic shifts and 
documented the simultaneous ascent of sunbelt cities and decline of northern 
urban areas. 

But what first appeared to be a uniform pattern of interregional job and 
population shifts has, in retrospect, been recognized as a modification in the 
hierarchy of US cities. In this process, a small number of mainly older urban 
places have retained their commanding role in the nation’s economy, even 
while sunbelt cities have enjoyed considerable growth. . 

As Cohen noted early in the debate, the economic rise of the Sunbelt was 
not “accompanied by a decisive shift in the traditional locations of corporate 
control functions (1977:212).” These functions—foreign sales by multi- 
national corporations, research and development expenditures, international 
banking, loans to large corporations, and business services—were instead 
dominated by firms based in the older metropolitan areas at the top of the 
urban hierarchy. The Corporate Headquarters Complex in New York City 
(1977) identified the linkages among corporate headquarters, corporate ser- 
vice firms (banks, law, accounting, and security firms, advertising agencies 
and other business services), and ancillary business services as the source of 
new and enduring forces of urban agglomeration. 

` Binding these activities together is the shift to services and the concomitant 
globalization of the economy (Cohen 1981). Because internationalization 
involves an increase in the complexity and uncertainty of a firm’s business, 
large corporations with heavy foreign involvements have become more, not 
less dependent, on their external providers of business services. Services 
have in turn increasingly become concentrated in a cluster of firms that are . 
disproportionately based in the very largest diversified urban centers. Busi- 
ness services in the largest centers have also capitalized on their existing 
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expertise and resources and have been the quickest to take advantage of 
emerging global markets. Furthermore, the decentralization of routinized 
manufacturing and information processing activities has paradoxically in- 
creased the need for centralization of key control activities, producing a 
handful of “global information hubs” (Moss 1987). Finally, the efficiency and 
effectiveness of information flows combines with other imperatives—the 
need for frequent, and interorganizational, face-to-face contacts at high 
levels, the importance of specialized talent—to produce a “spatial bias” that 
stimulates growth in global cities (Pred 1977, Drennan 1987). 

Noyelle & Stanback’s (1984) classification of metropolitan areas according 
to the industrial characteristics of employment and specialization of business 
services, corporate headquarters, transportation, distribution, and com- 
munications, highlights four major trends: (a) the successful, far-reaching 
post-industrial transformation of the “national nodal centers” of New York, 
Chicago, San Francisco, and Los Angeles and their continuing dominance in 
corporate services, especially in international fields; (b) the emergence of new 
regional and subregional nodal centers associated with the new bases of 
economic growth in the sunbelt; (c) the stabilization of the older diversified 
regional nodal centers in the northeast; (d) and the continuing decline of the 
specialized production centers, most of which are located in the north. 

What is striking about the metropolitan patterns of the new immigration is 
how sharply these flows have converged on precisely those cities with 
economies that are both post-industrial and heavily oriented toward in- 

ternational business services. As Table 2 shows, command and control func- 
- tions remain concentrated in the four large cities that Noyelle & Stanback 
designate as national nodal centers. Among the top four, New York is clearly 
dominant on every dimension, despite the declining corporate headquarters 
presence. Miami occupies a lower-level but specialized position, with its 
concentration of foreign banks and its foreign travel hub pointing to its role as 
“the capital of Latin America” (Levine 1985). 


Urban Restructuring: Implications for Immigrants 


That so many immigrants should have arrived in America’s post-industrial 
cities is an anomaly from the standpoint of established urban analysis. The 
consensus holds that the advent of the post-industrial economy has un- 
dermined the historic role that cities have played as staging grounds for the 
integration of ‘unskilled, newcomer groups. This view contends that urban 
economies suffer a mismatch between the skills of their minority residents and 
the job requirements of employers in the white collar sector (Kasarda 1988). 

The mismatch hypothesis first emerged as part of the structural unemploy- 
ment controversy of the late 1950s and early 1960s when concern over a 
then-sluggish economy and fear of an impending technological revolution led 
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a number of analysts to identify skill inadequacies as the source of employ- 
ment dislocation. Whether the effects of the 1964. tax, cut disproved the 
structural unemployment thesis, as some Keynesians argued, or not, the low 
unemployment rate of the late 1960s eclipsed the controversy as well as the 
fears of technological displacement. Consistent with the political trends of the 
decade, the mismatch discussion became focussed on the particular problems 
of black workers, and in particular, black males. 

This revised mismatch explanation attributed low incomes and unemploy- 
ment to the out-of-sequence timing of black migration to the major urban 
centers of the north at a time of manufacturing decline. As Levy notes, after 
World War H black migrants were coming to northern cities “in search of 
higher incomes, and in these early post-war years the cities could accommo- 
date them. Cities had both cheap housing, and most important, manufacturing 
jobs” (Levy 1987:112). But what was true in the late 1950s rapidly changed. 
As manufacturing declined, the central cities—and in particular, the older 
urban centers——lost their absorptive capacities. Whereas manufacturing jobs 
had long permitted “immigrants access into the mainstream economy (albeit 
to the bottom rungs of the socioeconomic ladder)” [Kasarda 1983:22], the 
growth of employment in services—whether consisting of high-level jobs or 
low-skilled jobs in traditionally female occupations—had negative im- 
plications, especially for black males. One generation after these mass migra- 
tions, easy-entry positions have continued to dwindle while the population of 
young blacks has rapidly grown, further aggravating the imbalance between 
supply and demand (Wilson 1987). 

As an explanation of black poverty the mismatch E was ques- 
tioned a decade and a half ago on. grounds not dissimilar to the criticisms 
made today. The dualists argued that the central employment problem of 
blacks was not so much unemployment as a lack of well-paying, stable jobs; 
jobs in the low-wage sector were readily available (Doeringer & Piore 1971). 
Other. researchers compared black migrants from southern rural areas to their 
northern-born counterparts and found that the newcomers were not the worst 
affected by changing skill requirements (Lieberson & Wilkinson 1976, Long 
& Heltman 1975). 

These anomalous findings were never fully followed up, but they foresha- 
dowed the inadequacies of the mismatch hypothesis with respect to the new 
immigrant population. One crucial contention of the mismatch hypothesis has 
always been indisputable, namely, that the proportion of urban jobs requiring 
high-level skills and formal training has long been increasing. But this truth 
obscures the implications of the studies cited just above: the allocation of 
low-level jobs varies among social groups that, while roughly similar in 
skills, differ in origins and predispositions. In the current context, these 
disparities have become more important, precisely because of the immigrant 
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admixture to the central city population. However, the differences are for the 
most part glossed over: the mismatch hypothesis simply extends its story 
about blacks to encompass the more variegated minority population of today’s 
cities. 

The problem, of course, is that labor market outcomes differ dramatically 
by ethnicity and, within ethnic groups, by nativity. In Chicago, for example, 
the Mexican-born population increased by 83,000 between 1970 and 1980, 
despite Chicago’s loss of 115,000 manufacturing jobs over the decade. In 
fact, the newcomers had very low skills—far less schooling, on average, than 
native blacks, whose economic position deteriorated sharply during this 
period (Kasarda 1988). In New York, with its more complicated ethnic 
mosaic, the contrast between high immigrant employment gains for foreign- 
born blacks and job losses for native-born blacks makes it difficult to argue 
that a demand-side explanation can explain much of the change (Waldinger 
1986-1987). Finally, the mismatch hypothesis is at odds with the immigrant 
phenomenon itself: if indeed urban employers are hiring none but the highly 
educated, and this trend is pronounced in cities that have made the most 
far-reaching shift from goods to services, then why have these post-industrial 
centers emerged as the principal settlements of the new immigrant popula- 
tion? 

A more satisfactory answer to this query suggests that immigration is part 
and parcel of a fundamental process of urban “economic restructuring,” in 
which the growth of services breeds a demand for both high- and low-skilled 
labor while increasingly excluding workers with middle-level qualifications. 
Phrased this way, the story about the restructuring of urban economies is 
linked to broader controversies about changes in inequality in the United 
States (see Harrison & Bluestone 1988). The argument for restructuring has 
three principal components relevant to the discussion here. While the mis- 
match view sees manufacturing as a concentration of entry-level jobs avail- 
able for the unschooled, the restructuring thesis conceptualizes manufacturing 
as a locus of unionized, primary sector jobs attached to well-developed 
internal labor markets, which in turn are available to workers of low- or 
middle-level skills. Secondly, the replacements for manufacturing are mainly 
service industries with a bifurcated job structure, offering low wages and 
unstable employment for less-skilled. workers displaced from manufacturing, 
and therefore yielding highest levels of earnings inequality in cities with the 
greatest levels of service employment (Stanback & Noyelle 1982, Nelson & 
Lorence 1985). Thirdly, job arrangements in the service sector lack well- 


61970-1980 employment and population changes calculated from.Tables 81 and 87, US 
Department of Commerce (1973) and Tables 116 and 122, US Department of Commerce (1983). 
Data on employment-population and schooling rates for new Mexican immigrants and native 
blacks in-1980 calculated from 1980 Census of Population, 5% Public Use Microdata Sample. 
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developed internal labor markets (Noyelle 1987), with the result that low- 
skilled workers, whether new entrants to the labor market or displaced from 
manufacturing industries, have few opportunities for upward mobility (Harri- 
son 1982). 

The implication of restructuring for minorities is generally a muted sub- 
theme in the overall debate. Levy (1987) notes that, in contrast to the case of 
white males, the incomes of black males are increasingly divergent, with the 
split particularly noticeable between the 25 to 34 year olds with at least some 
college and those who never finished high school. Whereas Levy’s finding 
draws attention to polarization within minority populations, most advocates of 
the polarization view contend that the shift away from manufacturing pro- 
duces disproportionate displacement among minorities and thereby widens the 
disparity in minority and white employment rates (Wacquant & Wilson 
1988). 

But the emphasis on displacement obscures one of the crucial insights of 
the restructuring hypothesis, namely that because the shift to services actually 
generates jobs for people with low skills it may also have created the demand 
for workers willing to work at low-status, low-paying jobs. While such 
‘low-wage jobs are increasingly found in the advanced services, the simulta- 
neous proliferation of very high paid service workers further adds to the labor 
demand for immigrants, who attend to the household needs and elaborate 
consumption wants of these high income gentrifiers (Sassen 1988). 

The restructuring hypothesis further suggests that once put into place, the 
immigrant labor force can bolster the declining goods production sector and 
help revive sagging urban economies. Various analysts have argued that the 
absorption of immigrants into the manufacturing sectors of New York and Los 
Angeles has helped stabilize industrial employment for three reasons: (a) 
Immigrants have been a more pliant labor force, with the result that man- 
ufacturing wages, relative to the national average, have declined; (b) immi- 
grants provide a more flexible labor force, thus allowing for more customized 
production and greater subcontracting; and (c) the growth of the immigrant 
population adds to the aggregate local demand for cheap consumer goods, 
which can best be served by local producers (Davis 1987, Sassen 1988, Soja 
1989). 

An alternative to either the mismatch or the restructuring hypothesis is an 
argument emphasizing shifts on the supply-side (Waldinger 1986-1987, 
Bailey & Waldinger 1988). This explanation begins with a point developed by 
Lieberson in his book, A Piece of the Pie (1980), namely, if nonwhites are 
low in the hiring queue, their access to good jobs is greater where the size of 
the preferred, white group is smaller. Reformulated to account for changes 
over time, this proposition suggests that compositional changes in which the 
proportion of whites declines set in motion a vacancy chain, allowing non- 
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whites to move into, and up, the job hierarchy as replacements for whites. In a 
process identified by Piore (1979) the demand for replacement labor is further 
amplified by the self-limiting character of recruitment into the secondary 
sector, which has historically relied on migrants, white and black, to meet its 
labor needs. As the children of earlier migrants ‘enter the labor market, they 
drop out of the effective labor supply feeding into the secondary sector; 
hence, an additional set of vacancies emerge, beyond those generated by 
compositional changes alone. These two sources of supply-side changes 
offset the impact of structural change: Though urban economies are indeed 
shifting toward a job mix requiring higher skills, replacement needs ensure 
ample demand for workers with low-levels of schooling. 

To the extent that either a demand or a supply-side approach has significant 
explanatory power, it should succeed in accounting for the changes occurring 
in principal immigrant metropolises of New York and Los Angeles. As 
Muller & Espenshade show in their book The Fourth Wave (1985), the 
immigrants of the 1970s had little connection to California’s booming service 
economy. Though two thirds of the new jobs added in Los Angeles county 
were white-collar, and manufacturing barely expanded, half of the decade’s 
‘Mexican immigrants worked in the factory ‘sector, with the next largest 
contingent in low-level services. Not only did the new immigrants garner all 
of the manufacturing jobs created during the 1970s, they replaced another 
55,000 workers who found jobs in other industries. Outside the manufactur- 
ing sector, “the most dramatic changes took place in personal services, where 
all immigrants combined took 24,400 jobs when the sector lost 7,000 jobs” 
(Muller & Espenshade 1985:59). 

The arrival of the new immigrants, Muller & Espenshade argue, had two 
effects on the economy of Los Angeles county. The newcomers directly 
spurred economic growth, both as a source of low-wage labor, and as 
consumers of services mainly produced and provided locally. More im- 
portantly, they provided replacements for Angelenos who migrated out of the 
county at a pace that exceeded the out-migration rate-for all of the country’s 
metropolitan areas, except New York, Jersey City, Buffalo, and Cleveland. 
‘The bulk of these out-movers were blue-collar workers; had these internal 
migrants and their white-collar counterparts not been replaced by immigrants, 
Los Angeles county would have lost, not added, half a million people.’ 

In New York, the pattern of immigrant incorporation during the 1970s 
reads much the same, notwithstanding a period of severe economic contrac- 
tion and erosion in the manufacturing sector (Sternlieb & Hughes 1987). At 


7A comparable treatment can be found in McCarthy & Burciaga Valdez (1986), who concur 
with Muller & Espenshade on the impact of Mexican immigration on low-wage job growth but 
conclude that low incomes diluted the immigrants’ purchasing power and thereby their effect on 
consumer markets. 
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the beginning of the 1970s, immigrants seemed poised for disaster, over- 
concentrated in precisely those industries that were to decline the most 
severely. Yet the biggest job losers of the seventies were whites, who lost 
almost twice as many jobs as the city overall. By contrast, employment 
among New York black, Hispanic, and Asian immigrants increased by 215%, 
64%, and 249%, respectively®. Despite liabilities of language, skill, and 
newness, rates of labor force activity for new immigrants compared favorably 
with the pattern for the white native-born (Bogen 1987, Papademetriou & 
Muller 1987). 

Had there beén no immigration, New York would have suffered an even 

more severe decline. As in Los Angeles, the arrival of the new immigrants 
provided a direct stimulus to the local economy. With a growing immigrant 
labor force, wage pressure eased up, making local manufacturers more com- 
petitive (Waldinger 1986); relative wages in goods production also declined, 
though without adverse impacts on the availability of labor (Sassen. 1988). 
Marshall further argues that “the abundant supply of immigrants . . . actually 
stimulated a demand for labor in immigrant industries” (1987:95); While 
much of her evidence bears on manufacturing (Marshall 1984), it also seems 
likely that immigrants had a positive impact on the local economy sector. 
But the principal function of immigrants was as replacements for native- 
born, mainly white, workers in the declining sectors. Manufacturing and 
retailing together contributed more than half of the decline in white employ- 
ment. By contrast, immigrants made their most sizeable gains in precisely 
these sectors, where they had already been concentrated in 1970 (Waldinger 
1986-1987). 
. This comparison suggests that a more interactive framework may be in 
order, with an emphasis on the mechanisms whereby migration becomes a 
self-feeding process. Thus, one might argue that the initial influx of immi- 
grants is closely related to demands for replacement labor, arising as a result 
of ethnic succession in traditional immigrant industries; this phase is likely to 
be extended, giving rise to a sustained need for replacement workers. Once in 
place, however, the new immigrant labor force has an indepéndent impact on 
the local economy. Wage pressure is reduced, thus easing. competitive pres- 
sures on manufacturing and other low-wage sectors (such as personal ser- 
vices); the immigrant population increases the demand for products and 
services from the local economy sectors; and as immigrants carve out occupa- 
tional or industrial niches, they develop channels of labor market information 
and support, as well as business enclaves that both reduce the cost ‘of 
migration for newcomers and directly open up recruitment channels. 


®Calculated from Waldinger, 1986—1987:376. 
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THE ECOLOGY OF THE NEW IMMIGRATION 


The geography of the major immigrant receiving cities has been transformed 
by the expansion of the corporate headquarters complex. What were stagnant, 
or in Los Angeles’ case, vestigial, downtowns have been transformed by an 
office construction boom that has often exceeded the underlying demand 
(Feagin 1983, Light 1988, Soja 1989, Squires et al 1987, Sternlieb & Hughes 
1987). To serve the booming office complex has come a new population of 
managers and professionals who comprise the huddled masses of the late 
twentieth century, piled up in neighborhoods in and around the central 
business district (CBD) (Zukin 1987). 

But it is not clear just how immigrant residential patterns are affected by the 
reinvigoration of downtown, and the related housing market impacts. Re- 
search on immigrant settlement patterns remains heavily influenced by eco- 
logical theory, which argues that structural forces, as well as the characteris- 
tics of the migration streams themselves, will continue to generate distinctive 
ethnic neighborhoods (Micklin & Choldin 1984). Newcomers can be ex- 
pected to settle in old, inner-city neighborhoods, close to their CBD jobs. The 
pace of neighborhood succession from native to immigrant will be influenced 
by the overall state of an area’s economy: rapid where economies are expand- 
ing, slow where economies are stagnant. And dispersion will take place with a 
rise in socioeconomic status, denoting a trend toward assimilation (Massey 
1985). © a sO 

The settlement patterns in the major immigrant-receiving areas are highly 
diverse, and the differences are not clearly consistent with the expectations of 
ecological theory. On the one hand, the emergence and burgeoning of 
Koreatowns, Little Havanas and Haitis, and Odessas by the Sea seem to 
replicate the concentrated, centralized pattern established by turn-of-the- 
century immigrants. Perhaps the best example is the rebirth of America’s once 
- ageing Chinatowns, of which San Francisco’s is a typical example. Though 
the community seemed to be facing imminent physical disintegration in the 
mid-1960s (Nee & Nee 1973), the decline has since been turned around. By 
1980, population gains produced high Chinese density in the core areas, led to 
overflows beyond the traditional boundaries, and attracted a massive influx of 
Asian capital, which in turn spurred further growth (Godfrey 1988 ). The 
pattern of Mexican settlement in Chicago is analogous, with the heavy 
inflows of the 1970s yielding a shift from a more decentralized pattern of the 
1960s (Ropka 1973) to one of concentration in old industrial districts close to 
the city’s center (Belenchia 1977). These areas, which had long served as 
white immigrant and ethnic enclaves, saw rapid white-to-Mexican transitions 
during the 1970s (Chicago Fact Book Consortium 1984). 

Though notable, the patterns of Chinese or Mexican clusters in established, 
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inner city ethnic enclaves are anomalous in two respects. First, the centraliz- 
ing imperative of the manufacturing industries of the Central Business District 
exercises less force than before. Though clustering next to the CBD persists, 
immigrant neighborhood formation in the older cities also takes a form that 
Ivan Light (1983) has identified as “leapfrog migration.” Ironically, the 
decentralized pattern is most in evidence in New York City, where only 
Dominicans and Chinese have established sizeable concentrations in Man- 
hattan; the bulk of the immigrants have settled in the “outer boroughs” (Bogen 
1987, Waldinger 1987). Even in these locations, the tendency has been for 
newcomers not to settle in the traditional, immigrant ‘neighborhoods, but 
rather in areas that until recently supported a white, middle-class population. 

The ecological model also bears a limited relationship to the actual process 
by which the immigrant-receiving cities of the sunbelt developed. In Los 
Angeles, for example, where the build-up of downtown LA pushed out the 
old Mexican barrio from the CBD before the 1920s, Mexicans jumped over 
other close-in areas to settle in emerging streetcar suburbs close to the city 
borders (Romo 1983). Over the years, as population grew and the rapid 
deconcentration of Los Angeles industry (Soja et al 1983) encouraged resi- 
dential dispersion, the Mexican community spilled over the city boundaries. 
The end result was a vast, 15-mile wide community, which though known as 
East Los Angeles, lies mainly in unincorporated territory within Los Angeles 
county (Nelson 1983). By 1980, Hispanic majorities had emerged in numer- 
ous suburban municipalities that now found themselves part of the burgeoning 
East LA barrio (Garcia 1985). However, the attraction.of low-cost housing 
being vacated by whites and the importance of proximity to industrial jobs 
have constrained the dispersion of Mexican settlements (Baer 1986, Davis 
1987). Dagodag (1985) presents.evidence suggesting that new immigrants 
were particularly likely to settle within a ten mile belt, south of the CBD, that 
lies mainly within city boundaries. Growing convergence on more central 
locations has also led to incursions. into the black ghetto; the Mexican 
population living in Watts more than doubled between 1970 and 1980 (Oliver 
& Johnson 1984). 

Settlement patterns in Miami reveal still another variant to the standard 
ecological model. According to Boswell & Curtis (1984), a core of Cuban 
settlers preceded the large wave of refugees who arrived following the Cuban 
revolution; the refugees then converged on that initial area of settlement. In 
the 1950s, this area was economically depressed and had been losing pop- 
ulation; hence both cheap housing and low-rent storefront retail spaces were 
available. Proximity to jobs in the larger economy does not seem to have been 
an important attraction, since manufacturing had already shifted out of the 
core (Mohl 1983). Thus, a pattern of over-concentration in the Miami central 
city rapidly developed, with the clustering of Cubans of all classes in Little 
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Havana (Cooney & Contreras 1978) providing the basis for the enclave 
economy that.developed.during the 1970s (Portes & Bach.1985, Portes 1987). 

But while’Miami’s Cuban community is over-centralized, blacks are main- 
ly dispersed in outlying areas, with the largest concentration residing in the 
unincorporated area called’ Liberty City. The origins of this ghetto go back to 
the:inability of blacks to find housing in Miami’s segregated housing market 
long before the Cuban influx, -and.to local government.and ‘business leaders’ 
desire to-remove blacks from proximity to the CBD (Mohl 1983). Thus, the 
poorest, -most alienated population resides not in the. “inner city” but in an 
outlying area. And this presence is in turn felt in the evolving geographic 
patterns of the Cuban population, which has skirted black concentrations as it 
has suburbanized (Boswell & Curtis 1984). 

A major focus of research has been on patterns of residential segregation. 
The influx of new Hispanic immigrants during the 1970s appears to have had 
two distinct effects in the five major areas with which we are concerned. First, 
the pattern of residential transitions shifted. In the 1960s, few Hispanic 
enclaves arose, with neighborhoods containing Hispanics also gaining Anglo 
residents (Massey & Mullan 1984). In the 1970s, by contrast, those same 
neighborhoods lost their Anglo residents, and Hispanics converged on in- 
creasingly homogeneous clusters (Bean et al 1987). Second, the segregation 
of Hispanics (foreign and native-born) increased on all of the dimensions 
along which it is measured. Though the levels reached in 1980 remained well 
below the highly segregated pattern characteristic of blacks (Massey & Den- 
ton 1987), Hispanic segregation rose more sharply in the five top SMSAs than 
in other areas with sizeable Hispanic concentrations (Bean et al 1987). The 
increase was lowest in New York, which already had the most segregated 
pattern, and where there was little net increase in the Hispanic population— 
the principal factor behind the growing separation and isolation of Hispanics 
in the other principal SMSAs (Massey & Denton 1987). 


Changes in the Urban Core: Toward a Reconceptualization 


Research on residential segregation has become increasingly sophisticated 
over the past ten years, with the development of new (and refinement of old) 
indexes (Massey & ‘Denton 1988), techniques for distinguishing between 
voluntary and involuntary segregation (Lieberson & Carter 1982), and the use 
of structural equation models to predict-residential succession patterns (Mas- 
sey & Mullan 1984). Nonetheless, the underlying theory is both incomplete 
and abstracted from the cultural, institutional, political, and market factors 
that affect the allocation and use of space among groups. The methodology 
suffers from similar deficiencies, yielding ‘findings that are highly aggregated 
and not easily related to patterns on the ground. 

These shortcomings reveal a broader lacuna in American sociology, name- 
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ly the failure to develop a sociology of the housing and real estate markets 
(see Gilderbloom & Applebaum 1988). Thus, rather than a holistic approach 
to the study of urban spatial processes, there is a curious division of labor, 
with analysts focussing on their “own” piece of the urban land market, more 
or less insensitive to developments falling outside their immediate domain: 
Comparison of research on the settlement patterns of immigrants with work 
on urban revitalization? underlines these points. One would not know, from 
reading either of these literatures, that urban revitalization and immigration 
are ongoing and simultaneous phenomena in these five principal settlements 
of the new immigrant population. The only book that treats immigrant .and 
gentrification neighborhood transitions as related, and within a common 
analytic framework, is a 1988 publication by 4 geographer (Godfrey 1988)! 
The methodologies of the two fields stand in equally sharp contrast. Research 
on urban revitalization tends to focus on a single city or cluster of cities; 
studies of institutional actors (banks, political entrepreneurs, community 
organizations) are common approaches, as are surveys of the residential 
preferences and life-styles of individual gentrifiers; and considerable attention 
is paid to the impact of oem revitalization on housing and job opportunities 
for other city dwellers. !° 

No complement to these approaches seems to exist in the study of immi- 
grant settlement patterns and neighborhood transitions. Yet the processes by 
which space is allocated and used by immigrants are undoubtedly affected by 
sociocultural characteristics that cannot simply be inferred from aggregate 
data. For example, differences.in the distribution of social and political 
resources among ethnic groups affect both the spatial distribution of popula- 
tion and their access to housing. Thus, the very high levels of black-Hispanic 
segregation in Chicago undoubtedly bear some relationship to the fact that 
Hispanics are virtually excluded from. the public housing sector, which is 
predominantly black (Orfield & Todaro 1983). Similarly, the organizing 
capacity of particular ethnic groups has frequently been invoked to explain 
high rates of self-employment (Light 1972), and such disparities, though not 
readily measurable by standard socioeconomic indices, may also affect eco- 
logical outcomes. New York’s Caribbean blacks have made their principal 


°T am using this deliberately neutral term to encompass both gentrification and accelerated 
CBD commercial development; another term might be the “Manhattanization” of downtown 
(Feagin 1983). It does appear that gentrification and CBD development are closely related in the 
cities with which we are concerned, though the ecological and architectural conditions of 
gentrification are absent in Miami and relatively weak in Los Angeles. 

'SAs Cornelius et al note, “Most studies of Mexican immigration have ignored housing 
patterns; and general studies of housing in the (southern California) region have not focused on 
the independent impact of immigration” (1982: 52). The same point holds for the other areas with 
which we are concerned. 
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settlements in areas of small, multi-family dwellings, where they have 
achieved high rates of homeownership (Foner 1987), a characteristic that not 
only distinguishes them from native blacks, but also provides a structural 
basis for an emergent and distinctive Caribbean ethnicity (Kasinitz 1987). As 
Godfrey (1988: 104) noted in his study of San Francisco, distinctive group 
predispositions also exercise a significant effect on the types of areas that 
attract immigrant populations. 

Whatever the methodological shortcomings of research on the ecology of 
these large immigrant-receiving cities, the fact that we know little about how 
urban revitalization and immigration are related to one another casts doubt on 
our understanding of both phenomena. For example, Berry has argued that 
gentrification 1s a phenomenon of older cities, in which tightness on the 
supply-side (caused by “contagious abandonment” of the older stock, far in 
excess of normal withdrawals), creates the necessary but not sufficient con- 
ditions for gentrification. Rather, “the process has to be activated by demand- 
side shifts rooted in professional job growth anchored in downtown offices 
... (Berry 1985:92). But it seems difficult to believe that Sohos were the 
only areas where the “urban revitalizers” could find a place to live when 
housing markets in gentrifying cities were sufficiently loose to allow for the 
influx of hundreds of thousands of immigrants—many of whom settled in 
areas of fairly high housing quality. 

A more useful analytical framework is the one developed by Taub et al 
(1984) who contend, on the contrary, that in the 1970s slackness on the 
supply-side created the preconditions for gentrification. Because the pent-up 
demand for housing among blacks eased off, “congestion and un- 
dermaintenance . . . became less profitable to those involved in urban real 
estate” and “the decline of pressure on the housing market . . . also took some 
of the profit out of encouraging racial change” (p. 7). The same set of 
circumstances yielded a permissive environment for gentrification: There was 
an ample stock of low-priced, attractive housing, and the availability of 
housing alternatives for blacks eased the pressures for black “invasion”. 
While Taub et al are not concerned with settlement patterns of immigrants, 
their argument can be extrapolated to account for why housing vacancies 
emerged for gentrifiers and immigrants simultaneously. 

Of course, circumstances in the urban housing market of the 1980s are 
altogether different. Rather than slackness, the problem is one of inadequate 
supply; the most extreme manifestation of these new conditions is the growth 
in the number of homeless. How the current housing crisis affects the 
settlement pattern of the most recent immigrants is as yet unexplored. One 
possibility may be a shift away from those locations with lowest vacancy rates 
and highest home or rental prices: That immigration to New York City should 
have risen during the 1980s suggests this has not happened. Research on 
gentrification has been particularly concerned with the potential for dis- 
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placement; the conversion of single room occupancy hotels to luxury con- 
dominiums appears to offer a clear example of the relationship between 
“urban revitalization” and homelessness. Yet it is equally appropriate to 
inquire into the impact of immigrants. As Zukin (1987) has argued, 
“gentrification represents a filtering up of housing” (p. 141). The same might 
be said for immigrants, who not only add quantitatively to the demand for 
housing, but who are likely to be able to draw on greater resources in the 
search for housing than many native, minority residents. 


CONCLUSION 


The convergence of the post-1965 immigrants on America’s largest urban 
areas was as unexpected as the composition of the new immigrants and their 
growing numbers. Just why cities provide a supportive environment for the 
newcomers at a time when urban economies have undergone great trans- 
formations remains an unanswered question. In this essay, we have tried to 
develop a framework that emphasizes the interaction between supply-side 
developments that create vacancies for immigrants and those demand-side 
consequences of their arrival that make immigration a self-feeding process. 
More puzzling still is the linkage between the changing internal geography of 
the immigrant-receiving areas and the settlement patterns of the newcomers, a 
question that researchers have generally not addressed. As with labor market 
developments, compositional changes are likely to have had an important 
effect. A supply-side account might do best in explaining why housing 
vacancies opened for immigrants and gentrifiers simultaneously during the 
slack housing markets of the 1970s. 
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INTRODUCTION 


The study of adolescence has long been dominated by psychologists, begin- 
ning with G. Stanley Hall (1904). In the 1980s a great increase in research 
activity has occurred, culminating in the founding of the Society for Research 
in Adolescence. Along with that increase in volume has come a major shift in 
the focus of adolescent research. Whereas most earlier research was limited to 
the study of individual adolescents carrying out their developmental tasks 
(Erikson 1968), an increasing proportion of research now places the biologi- 
cal, cognitive, and emotional development of adolescents in a broader social 
context. i 

The changing orientation of adolescent research is in part a function of the 
breakdown of traditional barriers between fields. Outstanding developmental 
psychologists now perceive adolescence as occurring in historicál, social, 
organizational, cultural, and institutional contexts (Bronfenbrenner 1979, 
Lerner & Foch 1987). Biological determinism has been discredited, and the 
study of pubertal development is now concerned with the evaluation of 
physical development by social actors (Brooks-Gunn 1984, Dornbusch et al 
1987a). 

Some of the changes in orientation within the field of adolescent research 
are a product of contributions by sociologists. Sociologists who have helped 
to reshape the research agenda include Zena Blau (1981), Orville G. Brim, Jr. 
(Brim & Kagan 1980), Albert K. Cohen (1955), James S. Coleman (1961, 
1987), Glen H. Elder (1974, Elder et al 1985), Joyce Epstein (1981), Howard 
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Kaplan (1980), Joan McCord (1978, 1983), Morris Rosenberg (1965, 1979), 
and Roberta G. Simmons and Dale A. Blyth (1987). Their studies are an 
integral part of the knowledge base in the field, and their ideas have per- 
meated the work of researchers drawn from other disciplines. The result is a 
swiftly developing field that appreciates the sociological perspective, one in 
which researchers from other disciplines often think like sociologists. 

Accordingly, this chapter is not limited to a recitation of work by-persons 
trained to be members of the sociological union. Nor can the chapter encom- 
pass the full diversity of recent contributions to a ripening field. Its goal. is 
simply to present some current perspectives on adolescence that should 
interest sociologists, although little known among them. The professional 
affiliation of each researcher has not been a major criterion in selection. 

This brief summary of a few topics cannot claim inclusiveness. Omitted 
almost completely are those topical areas that have. been well covered in 
previous reviews of research, such as education, status attainment, and the 
self-concept. The content of the adolescent culture, in‘terms of its fads and 
fashions, is not discussed (Brown, in process). Political and social move- 
ments among adolescents are not covered (Lipset & Altbach 1969). Finally, 
sociological studies of delinquency, a major societal problem, are-discussed 
only insofar as the research relates to central themes of this review. Most 
sociologists are well acquainted with the leading theories (McCord & McCord 
1959, Cloward & Ohlin 1960, Matza 1964, Short & Strodtbeck 1965, 
Burgess & Akers, 1966, Hirschi 1969, Sutherland &-Cressey 1978, Elliott et 
al, 1979, Gove 1980, Hirschi & Gottfredson, 1980) and research findings in 
the study of delinquency. This chapter’s modest goal is to inform sociologists 
of some exciting findings and controversies that are not part o the typical 
sociologist’s body of aa 


THEORETICAL DEVELOPMENTS | 


One of the most influential works in the field of adolescence addressed itself 
to a somewhat distant topic. Brim & Kagan (1980) argued, with considerable 
empirical evidence, that the impact of early childhood experiences was 
neither as pervasive nor as long-lasting as was commonly believed. The 
correlations between the characteristics of young children and those same 
children in the later years of childhood and adolescence. were surprisingly 
weak. An indirect result of this seminal work was an increased emphasis on 
the adolescent period. That period,. and the context in which adolescent 
development took place, was seen as worthy of study in its own right. No 
longer was adolescence viewed predominantly as an arena in which preexist- 
ing patterns were acted out. Adolescent development could be studied autono- 
‘mously, even though its links to childhood and adulthood were obvious. 
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Urie Bronfenbrenner (1979, 1986), a developmental psychologist, helped 
create an atmosphere in which the contributions of sociologists and an- 
thropologists were seen as vital to understanding the process of human 
development. His conceptual model alerted psychologists to the importance 
of context. He divided the ecological context of development into three parts. 
The microsystem referred to the pattern of activities, roles, and interpersonal 
relationships in a face-to-face setting. The mesosystem linked two or more 
settings in which the developing person was a participant, such as home and 
school. The exosystem linked two or more settings, but the developing person 
was not present in at least one of the settings. For example, the relation 
between the home environment and the work of the parent outside of the home 
is part of the exosystem for the child. 

The impact of Bronfenbrenner’s work was profound. Developmental psy- 
chologists found themselves, for example, examining the patterns of rela- 
tionships in particular family structures, the interaction of daycare systems 
and the quality of the home environment, and the generalizability of their 
findings to diverse ethnic and social class groups. Difficulties in communica- 
tion between the disciplines were reduced as sociologists were viewed as 
studying central aspects of adolescent development. 

But Bronfenbrenner also criticized the emphasis on status differences in the 
study of development. In particular, he inveighed against status addresses, 
which he viewed as little more than a label attached to an environment 
(Bronfenbrenner & Crouter 1983). The use of such addresses as social class, 
family size, ordinal position, area of residence, ethnic group, home or 
daycare, parental employment, or type of schooling is widespread, but only 
appropriate for an early stage of investigation. Within any specific social 
address, we are likely to find marked variation in values and in behavior 
(Yankelovich & Clark 1974). Social addresses are useful in delineating 
important aspects of the environment, and the researcher should go on to 
specify aspects of behavior in context that will produce specific de- 
velopmental outcomes. In urging that researchers take person-process-context 
into account, Bronfenbrenner (1986) is calling for a merger of structural and 
social psychological approaches that this author considers desirable in numer- 
ous arenas of social research. 

At the same time the imagery of pubertal development has been changing. 
Biological processes are no longer perceived as having a direct impact on 
human experience; instead, complex interactions between pubertal develop- 
ment, societal expectations, and perceptions -by significant others are ex- 
amined by researchers (Simmons & Blyth 1987). Pubertal change continues 
to be viewed as one of the key elements in the process of adolescent 
development. The emphasis on the social self (Mead 1934, Rosenberg 1979, 
Harter 1983) and on societal expectations (Brooks-Gunn & Petersen 1983) in 
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the study of puberty has made social factors all the more important in studies 
of adolescence. 

The pioneering work of Glen Elder (1974, 1980) has had a major impact on 
the study of adolescence. He examined the later life of children and adoles- 
cents born during the Great Depression of the 1930s. Cohorts differing by 
only a few years in their birthdates experienced different historical conditions. 
The effects of early hardship were very different for youngsters of differing 
ages, and their later experiences and outcomes also were not similar. Such 
longitudinal studies were influential in shaping the design of developmental 
analyses. Increased attention is now paid to historical trends in the larger 
society and to the incorporation of successive cohorts in longitudinal studies. 
This life-course perspective (Baltes et al 1980, Elder et al 1986) is now widely 
accepted among those studying human development. 

We must note the impact on later research of two images of the adolescent 
period. The first is that of adolescence as an oppositional culture, an image 
found in Coleman’s (1961) portrayal. His imagery emphasized the separation 
of adolescent peer groups and their culture from the world of adults, as well as 
the underlying struggle of peers with parents for control of adolescents. 
Research findings have forced modification of some of his perspectives; there 
is now a greater current emphasis on the variety of peer cultures and a view of 
peer groups as often reinforcing the values of the adult socializers. Yet, even 
as his ideas are attacked, Coleman’s approach continues to affect contempo- 
rary research on adolescence. 

Ogbu (Ogbu 1974, 1977, Fordham & Ogbu 1986), an anthropologist, has 
emphasized the power of peer culture in his analyses of images of success 
among inner-city blacks. He also emphasizes an oppositional culture, but one 
which focuses on opposition to the white middle-class emphasis on schooling 
and occupational success. Although blacks emphasize many of the same goals 
as whites, the history of discrimination against blacks leads them to believe 
that conventional strategies for success will not work for them; thus they 
admire those blacks who succeed without imitating white patterns of behavior 
in school and community. This oppositional culture is expressed by peer 
groups negatively defining those blacks who work hard and succeed in 
school. Ogbu’s perspective is currently influential among students of ethnic. 
differences in aspirations and achievement, but few studies have directly 
tested its central tenets. 

Finally, the general picture of adolescence as a period of storm and stress 
(Freud 1958) has not been supported by the weight of recent research. Clinical 
populations, selected because they exhibit problems of adjustment, provide a 
biased sample of the general adolescent population. Psychoanalytic models of 
adolescence, based on clinical studies, view adolescent rebellion against 
parental control as both normative and desirable. The struggle of the adoles- 
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cent for autonomy in these models makes parent-adolescent conflict inevita- 
ble. 

But the empirical evidence does not support this perspective. The increased 
bickering between parents and adolescent should not be viewed as an in- 
dicator of adolescent detachment from family ties (Steinberg & Silverberg 
1986, Silverberg & Steinberg 1987, Hill 1988). Even as adolescents strive for 
increased autonomy, close ties with parents can and do persist. About three 
fourths of all adolescents feel close to and identify with their parents, and 
many of the remaining group evidenced problems in family relationships long 
before the onset of adolescence. Only a relatively small proportion of all 
adolescents, perhaps less than 10%, experience a deterioration in the quality 
of their family relationships (Steinberg in process). 

Among adolescents, harmony in family life is more frequent than is conflict 
(Kandel & Lesser 1972; Offer and Offer 1975, Conger 1981). Indeed, 
adolescents who are closer to their parents psychically are higher on various 
measures of psychosocial adjustment (Jessor & Jessor 1977, Maccoby & 
Martin 1983; Steinberg & Silverberg 1986). Thus, earlier models of familial 
stress and adolescent detachment appear appropriate only for a small minority 
of adolescents. Yet those models continue to dominate the images of many 
researchers, parents, and practitioners. 


PUBERTAL DEVELOPMENT 


The focus of research on pubertal processes during adolescence is no longer 
purely biological, with assumptions that biological and age-related social 
changes are similar for most adolescents (Brooks-Gunn & Petersen 1983). 
The meaning of puberty, and even its timing, is seen as socially defined, with 
different effects in different contexts. In nonindustrial societies, puberty may 
mark the entrance to adult status. In developed societies, where education is 
long and almost universal, puberty may be the transition point to adolescence. 
But in all societies puberty is a cultural event that is viewed as a transition. 

For more than a hundred years, in diverse countries, the age of onset of 
puberty has declined (Tanner 1962). The trend, which is now slowing, is 
believed to be the product of improvements in nutrition. This earlier age of 
puberty has been associated with societal changes in expectations, such as in 
the age of first dating among Western youth. But individual differences in 
pubertal development are not a major determinant of individual differences in 
dating. Age, far more than sexual maturation, explains the variation in dating 
behavior (Dornbusch et al 1981). Similarly, black females, who tend to 
achieve sexual maturity earlier than whites, are slower to enter into dating 
relationships. Concerns among black parents about the results of early dating 
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may explain that differential between biological development and social 
behavior (Dornbusch et al 1984b). 

Selective patterns of association interact with biological development to 
produce specific behaviors. Early-maturing Swedish girls were more likely to 
drink and to be sexually active than were later maturers; but that pattern of 
behavior was found only among those early-maturing girls who had older 
‘friends. Early maturers did exhibit a tendency to have older friends, a product- 
of their choice or of their being chosen (Magnusson et al 1985, Magnusson 
1988). Physical development and social development exhibit a complex 
interaction. 

Some of the physical changes that occur during puberty have social mean- 
ings that are largely unexplored. Among children the same age, for example, 
taller children are expected by adults to evidence greater social maturity 
(Brackbill & Nevill 1981). Among adolescent boys, height is positively 
associated with educational expectations among parents and the adolescents 
themselves (Duke et al 1982). These expectations, related to physical changes 
occurring during puberty, may shape social interactions and have long-term 
effects. We need longitudinal studies of the impact of such social assessments 
during the adolescent transition upon later adult characteristics and behaviors. 

The prevalence of eating disorders among adolescent females of higher 
status is probably a product of the interaction of pubertal development and 
social standards. Sexual maturation brings body fat to young girls, and this 
does not conform to cultural values that promote thinness. The result is a 
process, positively associated with social class, that finds young women 
denigrating the results of normal sexual development. By the time they are 
sexually mature, one half to three quarters of all women want to be thinner 
than they are (Dornbusch et al 1984a). | 

The relation of this negative perception of female physical development to 
social class is puzzling. Veblen (1899) portrayed the emphasis on female 
thinness as part of an upper-status view of women as decorative appendages to 
wealthy men, a form of conspicuous consumption. Today the emphasis on 
female thinness is stronger in the higher social classes, but present even in 
lower-status groups in industrial societies. The emphasis on thinness appears 
to have spread throughout the stratification system. But what is the explana- 
tion of that spread and its persistence through time? The greater participation 
of women in the labor force would seem to contradict a central assumption of 
Veblen’s analysis, that women make no direct economic contribution. Yet 
this set of perceptions of the female body shows no sign of decay. Since the 
standards for physical attractiveness are social standards, sociologists are 
faced with an intriguing challenge. 

Brooks-Gunn (personal communication) has ‘studied an interesting example 
of the impact of a social context on the perception of pubertal development. 
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The ideal body of girls-in dance companies is thin. Therefore, female dancers 
are likely to be late maturers, for pubertal development adds fat to female 
bodies. Adolescent dancers who mature early. or on time have more eating 
disorders and problems of social adjustment than do dancers who mature late. 
We see here a clear example of contextual effects‘on the social perception of 
individual pubertal development. 

Finally, Simmons & Blyth (1987) have found that the context of the 
elementary school or the junior high school can have specific and lasting- 
effects on psychosocial adjustment during puberty. The transition “to the 
impersonal world of the junior high school may be difficult for pubertal 
children (Blyth et al 1978, 1983, Simmons & Blyth 1987). Multiple biologi- 
cal and social events are associated with an increase in problems of adjust- 
ment (Simmons et al 1987). Especially for girls, thrusting youngsters into a 
more difficult environment as they begin adolescence is associated with lower 
self-esteem, poorer school performance, and lower levels of extracurricular 
participation. Since these effects persist into the later grades of high school, 
we have evidence here of the importance of taking into account the stage of 
development of adolescents before designing institutions in which they must 
participate. This study is a model for research that blends the study of social 
institutions with the analysis of adolescent biological development. 


FAMILIES IN CONTEXTS 


Most studies of family processes focus on infants and young children, for 
there is a prevalent assumption that parental influence wanes as children age. 
There is little evidence on which to base that belief. Walberg & Marjoribanks 
(1976) reanalyzed data from a four-year longitudinal study of reading 
achievement among English women. The impact of parental encouragement 
and stimulation was as great for adolescents as it was for young children. 
Parents continue to influence their children during adolescence, and there is a 
need for more research on the parenting process during that period (Marjori- 
banks 1987). 

With respect to studies of family processes in adolescence, there is a need 
for greater integration across disciplinary lines. Psychologists have empha- 
sized the cognitive and attitudinal attributes of individuals. Sociologists have 
concentrated on status addresses, especially indicators of social class and of 
status attainment. Social psychologists have looked at some parenting pro- 
cesses, but seldom have they examined the interaction of variables in diverse 
contexts. Because of the lack of integration across disciplinary perspectives, 
our general theoretical understanding is weak. 

We have learned that social status indicators are relatively poor predictors 
of children’s educational achievement, compared with family process mea- 
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sures, and that family processes are related to children’s achievement to an 
important extent across all status groups (Dornbusch & Wood 1988). That 
gives us confidence in the importance of parenting behavior in adolescence, 
and in the utility of breaking down disciplinary boundaries as we seek to 
develop our understanding of parent-adolescent interactions. 

One aspect of the total familial environment is too seldom studied; namely, 
its stability or inconsistency. Pulkkinen, a Finnish psychologist, studied 
environmental stability and change in a longitudinal study of children, be- 
ginning at eight years of age and extending through their adolescence. She 
measured the frequency of events like divorce and remarriage, changes in 
maternal employment, moving of the family, and changes in educational 
arrangements. The greater the number of such changes and the more that the 
child or adolescent was exposed to simultaneous changes, the greater was the 
likelihood of personal and social problems in adolescence. Indeed, the level 
of family stability had a greater impact on development than did the social 
class of the family (Pulkkinen 1982, also see Simmons et al 1987). 

The level of stability of the family environment may be correlated with 
inconsistency of parenting, although there are no direct studies of the relation 
between these two sets of measures. Dornbusch et al (1987b) found that 
adolescents from families in which the parents exhibited mixed or inconsistent 
child rearing styles had lower grades than did adolescents whose parents 
emphasized a single parenting style. There was also a slight tendency for 
inconsistent communications from parents to be associated with a lower 
correlation between hours of homework and grades. Perhaps inconsistency in 
the home environment creates anxiety, and that anxiety reduces the relation 
between the student’s effort in school and the grade received. 

Steinberg et al (1988) have an interesting interpretation of these findings 
about inconsistency. They note that parental consistency may play a special 
role in promoting school success during adolescence. It appears to them likely 
that consistency is a component of parental demandingness, the creation of a 
stable set of norms and expectations to which the adolescent is expected to 
respond. 

Just as family processes may be linked to the stability of the familial 
environment, so may there be interactions of family variables and community 
variables. Coleman (1987), who emphasizes community weakness as a basis 
for poor schooling, sees strong communities as based on relatively strong 
families. Direct studies of family embeddedness and community strength 
seem necessary and are in process at Chicago, Stanford, and Wisconsin, but 
our knowledge of community influences during adolescence is surprisingly 
weak. Much of the contemporary discussion on communities is primarily 
based on anecdotal evidence (Steinberg 1988). 

An indication of the importance of community variables is found in Zena 
Blau’s (1981) study of black and white children and their families. She 
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developed a measure of social milieu, representing exposure to persons of 
middle-class education or occupation. Her measure of milieu was second only 
to race as a predictor of scores on intelligence and achievement tests, serving 
as a better predictor than the family’s own educational and occupational 
status. The impact of middle-class exposure was especially strong among 
black families. 

In addition, in Bronfenbrenner’s (1979) terms, most studies of familial 
influences on adolescent academic achievement have focused on the ‘micro- 
system’ (e.g. the family’s influence on the adolescent’s personality) or the 
‘exosystem’ (the influence of the social structure within which the adolescent 
lives) rather than on the ‘mesosystem’ (the concrete linkage between experi- 
ences in the home environment and experiences in the school). Leichter 
(1974) has noted that the process by which parental behaviors are linked to 
educational effects is usually vague and implicit, rather than explicitly 
observed. One exception is the pioneering work of Epstein & McPartland 
(1977), who studied the behavior of the child in the home and its relation to 
school performance. 

Lightfoot (1978) reported that, despite the profound impact of family life 
on school processes and the academic success of children, family-school 
relationships were largely ignored by researchers. Those sociologists who 
sought to study the relations between the family and other socializing in- 
stitutions emphasized the organization of each social system rather than their 
dynamic intersection. For example, we know that children who are ex- 
ceptionally resistive at home are likely to have poor relationships with teach- 
ers and peers and school, and to do poor academic work (Patterson 1982). 
But, as Maccoby and Martin (1983) note, we do not know if the child’s 
defiant attitude at home makes an independent contribution to the problems at 
school. . 

With respect to ethnic contexts, Entwisle (in process) notes that we need ` 
more studies of the similarities: and differences among ethnic groups in the 
associations between family processes and adolescent school performance. 
Dornbusch et al (1987b) compared the relation of parenting styles to school 
outcomes across ethnic groups, but there are few studies of family processes 
at the high school level that study several ethnic groups simultaneously. 
Clark’s (1983) work on younger children shows general similarity of family 
processes that improve student performance for black, Hispanic, and non- 
Hispanic white students. Hess & Holloway (1984) note similarity among 
maternal variables in studies of math grades among black and caucasian 
students. 

Stevenson (1986) has done excellent comparisons of family processes 
within specific groups of Asian and American children. The patterns that he 
has observed are provocative and suggest additional research, such as study- 
ing the relation of collective orientations among American adolescents to their 
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levels of deviance and school performance. A study of Japanese education 
(Leestma et al 1987) indicated that similar parenting strategies may be 
effective in a variety of contexts. We are likely to find greater group differ- 
ences in the frequency of a particular family behavior than in the relation 
between that behavior and some adolescent outcome. 

Similarly, we need to understand gender differences as they relate to family 
functioning (Hill & Holmbeck 1987, Montemayor & Brownlee 1987). For 
example, we know that early-developing males come to have higher educa- 
tional expectations for themselves, and their parents also have higher ex- 
pectations for them. Yet no such process is found among early-developing 
females (Duke et al 1982). Gender differences are also noted in attributional 
processes, with females much more likely to believe that the cause of a poor 
performance in mathematics. is their lack of ability (Dornbusch 1974, Eccles 
et al 1983). Although ability is more important than socioeconomic status as a 
predictor of educational performance for males, the reverse is true for females 
(Alexander & Eckland, 1975). We need to examine the family and societal 
processes that produce such gender-specific findings. 

Our knowledge of gender interactions during adolescent development is 
surprisingly sparse (Steinberg 1987). Although there are gender differences in 
the development of autonomy, there are few differences in the activities of 
sons and daughters with their parents (Montemayor & Brownlee 1987), or in 
the extent of disagreements on rules for sons and daughters (Hill & Holmbeck 
1987). Detailed analyses indicate the importance of differentiating among the 
four different dyadic relationships in the family: mother-daughter, mother- 
son,.father-daughter, and father-son. The intensity of the mother-daughter 
relationship is greatest, and the father-daughter relationship is low on emo- 
tional involvement (Steinberg 1987). We need more studies of gender differ- 
ences and similarities in family relations and interaction as they are trans- 
formed during adolescence. 

With respect to social class, less advantaged parents value education at 
least as much as those of higher status, yet they-often do not act so as to 
promote academic effort by adolescents. Perhaps parents do not know the: 
kind of effort expected or how much effort the student is actually expending. 
This lack of parental demand for effort is linked to a tendency for many 
well-intentioned teachers to expect less effort and performance from dis- 
advantaged adolescents (Natriello and Dornbusch, 1984). The result is a form 
of institutionalized racism, a racism without racists (Massey et al 1975). 


FAMILY BEHAVIORS 


Let us now turn to specific behaviors within the family, examining their 
relationship to adolescent behaviors and outcomes. Usually the studies report 


SOCIOLOGY OF ADOLESCENCE 243 


cross-sectional correlations, so that the direction of causation is unknown. We 
know that the behaviors of the adolescent often influence the behaviors of the 
parents (Stolz 1967, Maccoby & Martin 1983), so we must suspend judgment 
on the direction of causation. Yet the pattern of findings is suggestive of an 
underlying typology of parent-child relations (Baumrind 1971, 1973) that we 
soon discuss. 

With respect to verbal communication in the home, most studies are of 
young children. The studies summarized by Hess & Holloway (1984) hint at 
relations that persist through adolescence. For example, better school per- 
formance was found among students who participate in mealtime con- 
versations, and among those who were more often asked for information by 
their parents. 

Chaffee and his colleagues (Chaffee et al 1971, Chaffee 1978) emphasized 
styles of communication within the home. They differentiated between two 
general styles of communication. One style (social) emphasizes conformity to 
the opinions of others, even to the extent of subordinating one’s own views to 
the views of other persons. A second style of communication (content) 
emphasizes concern for the content of ideas, a willingness or eagerness to 
consider seriously the diversity of ideas held by people. Kohn (1977) viewed 
such. differences in emphases as- powerful: forces that tend to perpetuate 
inequality in the system of stratification. Those parents whose occupations 
place them in a subordinate role, with obedience expected as part of the job, 
emphasize conformity more and self-direction less than do parents whose 
occupations provide more opportunities for self-direction. 

Across a wide variety of social groupings, there is a general tendency for 
emphasis on conformity to be associated with low grades in high school, and 
for an emphasis on diversity of content to be associated with high grades. 
Non-Hispanic white families tend to emphasize conformity less than do the 
other ethnic groups, while families with a higher level of parental education 
emphasize diversity more and conformity less than do parents with less 
education. Despite these group differences in the frequency of different forms 
of communication, the relation of communication style to school performance 
is similar in all social groups (Dornbusch & Wood, 1988). 

Epstein & McPartland (1977) believe that an open style of communication 
at home increases the likelihood that children will interact successfully with 
teachers, fellow students, and others from whom they can learn. These social 
skills should increase students’ positive attitudes towards learning and im- 
prove the quality of their school experiences (Epstein 1981). 

Another important process is family decision-making, which Dornbusch et 
al (1985) showed to be related to adolescent deviance. Part of the explanation 
for the high rate of adolescent deviance -in single-parent families is their 
propensity to permit adolescent youths earlier contro] over.their own behavior 
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in such decision-making areas as choice of friends, choice of clothes, spend- 
ing money, and time they must come home. Permitting such early decision- 
making by the youth alone is an understandable response by an overloaded 
single parent. Across all social classes, whatever its expediency, too early 
autonomy appears to have an unfortunate consequence—an increased proba- 
bility that the youth will engage in deviant acts. 

In a national sample of American youth, the higher rate of deviance among 
children in single-parent families is associated with lower performance on a 
number of measures of school performance (Dornbusch & Wood 1988). 
Controlling for social class, children in single-parent households were not 
doing worse on IQ tests or achievement tests; yet those students showed a 
tendency to be rated lower by their teachers on intellectual ability and 
performance than were students from two-natural-parent households. The 
lower ratings from their teachers received by adolescents from single-parent 
homes were associated with the level of adolescent deviance in those house- 
holds. A different sample of adolescents replicated the finding that single- 
parent households were more likely to give early autonomy, and that early 
autonomy was associated with lower school performance (Dornbusch & Gray 
1988). 

Regardless of the level of parental education, higher grades are consistently 
associated with joint decision-making, which involves both the parents and 
the adolescent. Lower grades are somewhat more likely in families in which 
the parents dominate the decision-making process (Dornbusch & Gray 1988). 
The relation between joint decision-making and educational performance is 
consistent with results reported by Epstein & McPartland (1977), who found 
that greater participation by the student in family decision-making was associ- 
ated with more positive personality development and better coping skills in 
school. 

As the adolescent gets older, parental domination of decision-making goes 
down and youthful autonomy increases. Joint participation of parents and 
adolescent in decision-making is highest in the middle years of adolescence 
and then declines. Most youths are eventually going to leave the parental 
household, and parents will lose much of their influence. Joint decision- 
making appears to be a way station prior to the youth’s assuming almost 
complete autonomy. Since joint decision-making is more frequent among 
middle-class adolescents, this form of decision-making behavior might ex- 
plain some of the oft-reported association between social class and measures 
of deviance and school performance. 

Another family process is parental monitoring of adolescent behavior. 
Pulkkinen (1982) assessed Finnish children through teacher ratings, peer 
nominations, and self-reports at the ages of 8, 14, and 20 years, with the focus 
on aspects of self-control. At the 14-year assessment, in-depth interviews 
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were conducted with a sample of parents. “Child-centered” parents, who 
respected the child as an individual and planfully supported the child’s 
development, did a better job of monitoring the whereabouts and activities of 
the adolescent than did those who used “parent-centered” child rearing, which 
was more short-sighted, inconsistent, and dependent on the parental mood. 
The children who were products of parent-centered childrearing were im- 
pulsive, uninterested in school, and likely to be truant. 

In the United States, Patterson (1976; 1982) has studied boys at high risk 
for delinquency. Like Pulkkinen, Patterson finds that the failure to monitor 
where the children are and what they are doing is a factor carrying particularly 
strong risk of subsequent delinquent behavior in the children. Dornbusch 
(1987) has found that American Catholic adolescents are more likely to make 
choices that their parents do not endorse if they believe that their parents are 
unlikely to find out their choice. The peer groups in which American adoles- 
cents participate are not perceived as supporting the values of the parental 
generation (cf Bronfenbrenner 1970). A situation-ethics approach seems to be 
typical of American youth, and there is little disposition to act on the basis of 
internalized standards when significant others are unlikely to judge on the 
basis of those standards. Finally, Steinberg (1986) found that those latchkey 
children whose parents knew their whereabouts were as resistant to peer 
pressure as those who were directly supervised by an adult. 

Although close surveillance can force adolescents to do what immediately 
needs to be done, such a level of oversight cannot be sustained throughout the 
days and years. While parental monitoring of adolescents usually has positive 
results, too close surveillance often boomerangs. Work in social psychology 
by Lepper and his associates (Lepper et al 1973, Lepper & Greene 1978) 
indicates that there is a “cost” to rewards. Persons receiving external rewards 
tend to perceive outside forces as the causes of their behavior, and this often 
reduces the importance that they attribute to their internal motivations. 

This attributional model, based on the individual’s perception of the causes 
of behavior, has been applied to adolescents, as well as to children and young 
adults. In the largest field study of these laboratory-based ideas (S. Dorn- 
busch, J. T. Elworth, and P. L. Ritter, in process), both positive extrinsic 
reinforcement and negative extrinsic reinforcement are correlated with lower 
high school grades and further decline in grades. Parents’ expressing negative 
emotions and being uninvolved are both associated with lower and declining 
grades. Only parental encouragement, praise, and offers to help (note that 
“offers to help” leaves control in the hands of the student) appear to be 
associated with higher and improving grades. 

The expected relations between grades and parental reactions are generally 
present in each ethnic group, type of family structure, and parental education 
group, for each of the two sexes. It appears that, just as monitoring is 
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preferable to close surveillance of adolescents, low-key encouragement tends 
to be better than the use of direct and often heavy-handed rewards and 
punishments. Extrinsic rewards or punishments, by providing an external 
reason for behavior, tend to reduce the internal motivation of an adolescent. 
In a world in which continual surveillance of the adolescent is impossible, 
parenting techniques that are more likely to produce internal motivation are 
more likely to be effective. 

Baumrind showed, in a series of studies of parenting techniques (Baumrind 
1973, Baumrind & Black 1967), that the “authoritative” pattern of child 
rearing, characterized by a combination of warmth, demandingness, and 
verbal give-and-take, is superior to “authoritarian” or “permissive” parenting 
styles. Other researchers have found powerful empirical support for the utility 
of Baumrind’s typology of parenting styles. Adolescents from authoritative 
homes do better on psychosocial development (Hill 1980, Maccoby & Martin 
1983). Dornbusch et al (1987b), using a larger and more diverse sample of 
adolescents, found that, controlling for the level of parental education, both 
authoritarian and permissive parenting styles were negatively associated with 
grades, and authoritative parenting was positively associated with grades. 

The size and diversity of that sample, however, also permitted carrying out 
separate analyses for four ethnic groups: Black, Asian, Hispanic, and non- 
Hispanic white. The expected relationships clearly held for non-Hispanic 
whites and blacks, but not as clearly for Asians or Hispanics. Asians, in 
particular, showed a discrepant pattern; only authoritarian parenting showed 
the-expected negative correlation to grades. However, the frequency of use of 
particular parenting styles indicated that Asian parents used authoritarian 
parenting more and authoritative parenting less than did non-Hispanic whites. 
Yet Asians as a group received high grades in school. We obviously must 
move beyond typologies derived from a single ethnic group if we are to 
understand the success of Asian adolescents. We need careful studies of the 
meaning of specific behaviors as interpreted by members of various social 
groups, particularly ethnic groups, if we are to produce major advances in our 
knowledge. For example, authoritarian parenting seems to have less negative 
impact on adolescents in first-generation immigrant families. Those adoles- 
cents are likely to perceive authoritarian parenting as more natural and less 
un-American, with parents exercising their responsibilities and children per- 
forming their filial duties (Dornbusch & Wood 1988). 

Nevertheless, it is impressive that authoritative parenting is so often associ- 
ated with positive outcomes across age groups, social classes, ethnic groups, 
and types of family structure. Clark’s (1983) study of successful parenting 
practices among poor black families produced findings that are consonant 
with the findings concerning authoritative parenting. In the family experience 
of successful poor black children, Clark found more frequent dialogues 
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between parents and children, parental encouragement of academic pursuits, 
clear and consistent limits for children’s behavior, warm and nurturing in- 
teractions with parents, and consistent monitoring of how children spend their 
time. 

A major deficiency of all these studies of parenting style among adolescents 
is that they are cross-sectional. Steinberg et al (in press) did a longitudinal 
study and corroborated the previous cross-sectional findings. Their results call 
into question the view that parental authoritativeness merely follows from, 
rather than precedes, the development of competence in children. Moreover, 
they controlled for the effects of achievement test scores, prior school grades 
and attendance, and socioeconomic status. We can conclude that the relation 
between the measures of parent-adolescent relations and school performance 
is not an artifact. In addition, the pattern of sex differences they observed 
corroborates a well-documented finding: the negative consequences of ex- 
cessive permissiveness among adolescents are greater for males, whereas the 
risks of excessive control are greater for females (Bronfenbrenner 1961). 

The authoritative style may represent an ideological commitment that does 
not readily change as children grow up. While authoritarian parenting de- 
creased with the age of the adolescent and permissive parenting increased 
with age, there was no relation between the age of the adolescent and the level 
of authoritative parenting (Dombusch et al 1987b). Since families with higher 
parental education tend to be somewhat lower in authoritarian and permissive 
parenting and higher on authoritative parenting, this pattern helps to explain 
some of the advantages of adolescents from higher social class levels. Given 
the power of authoritative parenting, it is noteworthy that we know almost 
nothing of the behaviors and situations that are determinants of authoritative 
parenting. 

Parenting style is a global concept, including ideas about communication 
patterns and family decision-making. It is worth noting that educational 
performance was lowest among students whose family parenting style was 
inconsistent, mirroring the finding we reported earlier, that inconsistent com- 
munications were associated with low grades. Similarly, single mothers 
showed a higher level of permissive parenting than did two natural parents, 
echoing the finding that single parents were likely to give too early autonomy. 

Stepfamilies, compared to families with two natural parents, tended to be 
more authoritarian and more permissive. Given the lack of societal norms for 
the interaction between stepparent and stepchild (Crosbie-Burnett et al 1988), 
it appears likely that stepparents either attempt to exercise heavy-handed 
control (authoritarian) or retreat from confrontations with the adolescent, 
creating a proto-single-parent household. Thus, the analysis of parenting 
styles appears to give a behavioral basis for oft-noted relations between family 
structure and adolescent competence and deviance. We need many more 
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studies that relate family behaviors to family structures if we are to develop 
theories to explain adolescent performances and deviance. 


PEERS AND THEIR INFLUENCE 


In choosing friends, adolescents tend to pick those who are like them. Except 
for romantic attachments, peer groups tend to be of the same gender. Friends 
tend to be chosen from the same social class, race, and ethnic group and from 
those who are approximately of the same age. This homophily, like liking 
like, extends to the realm of values and behavior, producing complications in 
the analysis of peer influence (Savin-Williams & Berndt, in process). 

If one looks at any. behavior or attitude and compares adolescents with their 
friends, there-appears to be powerful influence from peers. The similarity in 
attitudes or behavior is, however, just as likely to be a product of selection as 
it is an indicator of peer influence. Longitudinal studies show about equal 
impact of peer influence and selection on the basis of homophily (Cohen 
1977, Kandel 1978, Sheppard et al 1987). Since most discussions of peer 
groups and their influence are not based on longitudinal data, it is important to 
recognize that the power of peer groups to influence their members is often 
overestimated. 

In mapping the social world of adolescents (Blyth et al 1982), gender 
differences are noteworthy. The nature of friendship is different in the two 
sexes. Males are more likely to spend time with groups of males, while 
females are more likely to interact with a single other female. This pattern is 
related to the degree of emotional intimacy among friends. Males are less 
likely to be emotionally close to their friends than are females. Among males, 
there is a greater emphasis upon activities. These gender differences in the 
ways that adolescents interact with same sex age-mates appear relatively 
stable cross-culturally and across species (Hartup 1983, Hinde 1984, Savin- 
Williams 1987, Blyth & Foster-Clark, in press). 

Friendships with peers appear to provide social support that moderates the 
impact of stressful life events (Barrera 1981, Greenberg et al 1983). A small 
number of friends is the usual source of social support and influence from 
peers, just as closer friends are the basis among peers for the support of 
self-esteem (Johnson 1986). In discussing relations with peers, it is necessary 
to make distinctions among intimate friends, a larger group of friends or 
acquaintances, and a “crowd” (Brown et al 1986b). 

The character of peer culture among adolescents is not well charted. In part 
this is based on the profound influence of Coleman’s (1961) work on adoles- 
cent society. Instead of a single dominant peer culture among American 
adolescents, researchers have found multiple peer cultures that support the 
diversity of adolescent values and behavior (Brown, in process). In addition, 


SOCIOLOGY OF ADOLESCENCE 249 


where Coleman viewed adolescents as forming a set of central values opposed 
to that of adult society, evidence consistently suggests congruence of adoles- 
cent and parental values (Elkin & Westley 1955, Douvan & Adelson 1966, 
Kandel & Lesser 1972, Youniss & Smollar 1985). Indeed, the variation in 
values among adolescents was as great as the variation between adults and 
youths (Yankelovich & Clark 1974). The values of peers more often reinforce 
than oppose the values of parents (Bandura 1964). 

Even when forced to choose in hypothetical situations in which parental 
and peer standards were opposed (Berndt 1979), adolescents were careful in 
their choice of reference group. Peer pressure was more powerful when the 
choice related to peer relations, but adolescents tended to follow parental 
standards on more important issues (Brittain 1963, Larson 1972, Sebald & 
White 1980). Adolescents tended to see parents as the more important evalua- 
tors of their behavior, following peer pressure only when parents would not be 
likely to find out which choice they had made (Dornbusch 1987). 

In the world of the adolescent, the direction of peer influences is as likely to 
be positive as negative. Peers are more likely to reinforce existing attitudes 
than to produce novel forms of behavior. The adolescent is an active partici- 
pant in the selection of peer groups—there is a reciprocal relation between the 
qualities of the group and the qualities of the adolescent (Newcomb et al 
1986). 

For example, in acquiring or developing a close friendship, an adolescent 
girl’s choice will be influenced by whether or not she is a virgin; if she is, then 
she is more likely to choose another virgin as her friend. If the friend then 
loses her virginity, the girl often must choose to terminate the friendship or to 
lose her own virginity (Billy et al 1984, Billy & Udry 1985). It is interesting 
that the perception that their peers are.engaging in sexual intercourse is an 
incentive for sexual relations, even when the perception is false (Cvetkovich 
& Grote 1980). Even false beliefs have consequences. 

The sensitivity of the adolescent to evaluation by peers, parents, teachers, 
and others is increased by the combination of egocentricity and the developing 
ability to take into account the viewpoints of others (Keating, in process). 
This adolescent egocentricity (Elkind 1967, Shantz 1983, Lapsley et al 1986) 
leads to heightened introspection, with a tendency for adolescents to see 
themselves as the focus of the thoughts of others. They feel themselves the 
center of attention, and this affects various aspects of their behavior (Mechan- 
ic 1983). For example, adolescents are like adults in that about half of each 
group does not comply with a medical regimen. But the reasons for noncom- 
pliance differ. Any therapy which causes the adolescent to feel that he or she 
looks different from the peer group or less attractive may be rejected, even 
when the therapy is potentially life-saving (Klein et al 1984, Litt, in process). 

There are ethnic and gender differences in susceptibility to peer pressure. 
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With respect to sexual behavior, white adolescents seem more likely than 
blacks to be influenced: and to choose their friends on the: basis of the sexual 
behavior of peers. Adolescent females are more likely than males to be 
influenced by the perceived sexual behavior of peers (Billy & Udry 1983, 
Cvetkovich & Grote 1980). 

Adolescents who have close relationships with their parents show lower 
levels of sexual activity (Jessor & Jessor 1975, Inazu & Fox 1980). Not only 
is susceptibility to peer influence lower when the adolescent is close to the 
parent, but adolescents from high-risk families who face negative outcomes 
are more resilient when parents exercise closer supervision of their activities 
(Savin-Williams & Berndt, in process). In general, low status members of an 
adolescent group are more influenced by peers than are high status members 
(Savin-Williams 1987). 

Steinberg (in process) has integrated the findings of his own research 
(Steinberg & Silverberg 1986, Steinberg 1986) and that of others (Dornbusch 
et al 1985, Patterson 1982) to suggest that the transition toward healthy 
independence in the peer group is facilitated by autonomy from parents in the 
emotional sense, but not in the behavioral sense. Parents must help their 
adolescents to be released intrapsychically without removing behavioral con- 
straints. This perspective on the development of adolescent autonomy is 
compatible with our discussions of monitoring and parental permissiveness. 


DEVIANT AND CONFORMING BEHAVIORS 


Since most adolescents deviate from the prevailing societal norms at some 
time in one or more ways, deviation and conformity are relative terms. There 
are certain forms of deviance that are likely to have long-term consequences, 
such as adolescent pregnancy, and there are patterns of behavior that have a 
high probability of continuing into adulthood, such as committing multiple 
criminal acts. There is disagreement among researchers as to whether there is 
a single underlying tendency toward deviance (Jessor & Jessor 1977, Kaplan 
1980, Robins 1978), or whether some people engage in many forms of 
deviance and others are deviant only in a specific area (Bachman 1987, 
McCord 1983). In this section, we consider only some aspects of a few forms 
of deviation: adolescent sexuality, adolescent pregnancy, smoking and drug 
use, and multiple delinquent acts. 

The discussion of deviance among adolescents i is complicated by “status 
offenses.” Some behavior, such as sexual- activity and drinking alcohol, that is 
considered appropriate at an older age is viewed as deviant and illegal when 
carried out by young adolescents. The task of defining deviance is further 
complicated by the widespread adoption of patterns of behavior that are 
supposedly negatively evaluated by the wider society. If, as we note, some 
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“deviant” activities are engaged in by the majority of American youths, the 
study of deviance cannot be analyzed without regard to changing social 
trends. 

To illustrate, sexual behavior is not just the acting out of a preordained 
biological imperative. Social processes shape sexual behavior within a par- 
ticular setting. There is diversity in sexual behavior within the subgroups of 
our society, as well as cross-culturally (Ford & Beach 1951). Sexual behavior 
among adolescents is learned, and it responds to various social pressures and 
processes. 

The greater a mother’s own sexual experience as an adolescent, the greater 
the sexual activity of her adolescent daughter (Newcomer et al 1983). Family 
structure also affects sexual behavior; adolescent females from single-parent 
homes become sexually active at a younger age than those who grow up in 
two-parent families (Zelnik et al 1981, Inazu & Fox 1980). The larger the 
number of siblings in the family, the greater is the probability of a sexually 
active older sibling serving as a role model (Hogan & Kitagawa 1983). 

The schools now play a major role in sex education; indeed they are the 
primary source of sexual information. Sex education programs do not seem to 
affect the level of sexual activity among adolescents (Kirby 1984). But sex 
education does reduce rates of pregnancy among sexually active youngsters 
(Furstenberg & Brooks-Gunn 1985). 

Adolescents have recently become sexually active in greater numbers and 
at younger ages (Zelnik & Kantner 1977). By age 15, about 17% of boys and 
6% of girls have had sexual intercourse; at age 18, 67% of boys and 44% of 
girls; by their 20th year, the figures have gone up to 80% for males and 70% 
for females (H. Katchadourian, in process). 

Black adolescents become sexually active earlier than white adolescents. 
Billy (Billy et al 1984, Billy & Udry 1985) reports that the following 
percentages of 11 to 17 year-olds have had sexual intercourse: black males, 
77.3%; black females, 39.1%; white males 24.9%; and white females, 
10.3%. Frequency of religious observance is associated with lower levels of 
sexual intercourse (Inazu & Fox 1980). Jessor & Jessor (1977) successfully 
predicted which adolescents of both sexes would become sexually active in 
the next year. They tended to place a higher value on independence, valued 
academic achievement less, were lower in religiosity, and more tolerant of 
deviance. 

Litt (in process) notes that fewer than half of adolescents use a con- 
traceptive during first intercourse. The time between beginning to have 
intercourse and using a contraceptive is about a year. Perhaps they do not 
subjectively understand the risk they are taking, or perhaps they feel invulner- 
able. Therefore, most adolescent females become pregnant within six months 
after beginning sexual activity. 
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The age at first marriage of American women has risen to an all-time high. 
Furstenberg (in process) reports that in 1986 just 15% of all 19 years olds had 
ever married, compared to 40% in 1960. The postponement of marriage has 
contributed to the growing acceptability of premarital sexual activity during 
adolescence. More adolescents are thus exposed to the risk of pregnancy 
outside of marriage (Zelnik et al 1981, Furstenberg et al 1981). Furstenberg 
(in process) notes the result: Adolescent parenthood is not increasing, but 
there is an increase in the proportion of unwed teenage mothers. Only 13% of 
all births in 1985 were to adolescents, but three fifths of them were to those 
unmarried. 

Walker (1988) has described the post-World-War-II erosion of marriage 
among blacks. The difference between black and white rates of marital 
instability was small until 1950, but now there is a sharp divergence. He 
attributes the shift in the changing marriage market among blacks with many 
black women unable to find a desirable mate. Their choice of unwed mother- 
hood is thus not a return to earlier values, but is based on the current social 
and economic disadvantages of black men (Gibbs 1988). 

Now, turning to antisocial behavior that is deviant regardless of age, we 
find that such behavior increases dramatically in adolescence and is dis- 
turbingly prevalent in that age group. Responses on the National Youth 
Survey suggest that official arrest figures grossly underestimate the extent of 
delinquency. About 30% of the males and 10% of the females were classified 
as serious violent offenders who reported at least three violent offenses within 
a year (Elliott et al 1985). 

About 31% of males and 28% of females ages 12—17 years reported that 
they had used illicit drugs. The proportions using’ drugs were higher for 
non-Hispanic whites than for either blacks or Hispanics (McCord, in process). 
There is a connection among these deviant acts. According to the National 
Youth Survey, violent offenders were also likely to be involved in felony 
thefts (48%), drug sales (22%), and drug use (20%) (Elliott et al 1985). 

Similarities with peers are often found in studies of delinquents and sub- 
stance abusers. As previously noted, such similarities cannot be used to 
support the influence of peers. But McCord (in process) believes that two 
longitudinal studies do provide impressive evidence of peer influence. Having 
delinquent friends predicted a first arrest for an individual within the follow- 
ing three years (Snyder et al 1986). Among adolescents who had not smoked 
marijuana, those having marijuana-smoking friends were more likely to begin 
using marijuana in the next year (Jessor & Jessor 1977). Finally, Litt (in 
process) reports that, although the presence of smokers within the family 
contributes to the adolescent’s adoption of cigarette smoking, the most power- 
ful influence is pressure from a best friend to smoke. 

School achievement may reduce the likelihood of drug use (Brook et al 
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1986) as well as delinquency (Farrington 1986, Rutter 1978). Although it is 
hard to separate cause and effect, McCord (in process) believes that those not 
valuing success in school may be likely to reduce their academic investment 
before becoming deviant. Adolescents who drop out of high school have 
already been truant in elementary school, while those adolescents who are 
about to try marijuana denigrate the importance of academic achievement 
(Jessor & Jessor 1977). | 

‘Classical sociological theories explain deviant behavior among members of 
the lower class as a response to their lack of legitimate opportunities for 
success (Cohen 1955, Merton 1957, Cloward & Ohlin 1960). Differences in 
rates of delinquency and crime among ethnic and social class groups support 
the view that deviance is often associated with differential life chances.. But 
some of the differences in-offical crime rates may be a product of differential 
punishment. Black youths are more likely to be charged for an offense than 
are white youths who commit the same offense. 

Reviewing empirical studies that reflect this classical approach, McCord 
(in process) is concerned that increasing economic opportunities for dis- 
advantaged groups has not led to declines in delinquency. The lack of 
successful interventions leads to pessimistic conclusions about the utility of 
this approach. Indeed, our lack of fundamental knowledge on which to base 
interventions is indicated by the numerous failures to reduce delinquency, 
ranging from incarceration to education to job programs to counseling. She 
reminds us of the famous failure of the Cambridge-Somerville Youth Study, 
which attempted to prevent delinquency by providing counseling and other 
assistance over an extended period. Those in the treatment group were more 
likely to be recidivist criminals, mentally ill, or alcoholics than those in the 
control group (McCord, 1978, 1981). 

Wilson (1987) agrees with McCord as to the lack of effectiveness of most 
interventions to reduce delinquency. The sole bright spots in his summary of 
the literature are the work of the Oregon Social Learning Laboratory in which 
behavioral therapy techniques are used to train parents and their problem 
children (Patterson 1976, 1982, 1986; Patterson et al 1982), and various Head 
Start programs. He urges large-scale studies of such interventions among the 
highest-risk families and children. Such studies will add much to the needed 
knowledge base on which to build programs for the reduction of adolescent 
delinquency. 


CONCLUSION 


Several general conclusions are suggested by this brief summary of some 
topics in the recent research on adolescence. First, the study of social pro- 
cesses, linked to research on status groups, holds the greatest promise for the 


254 DORNBUSCH 


development of usable knowledge. Second, there is a need for more sociolo- 
gists to enter this field, given the wide acceptance of sociological perspectives 
among investigators from various disciplines. Finally, as research results 
accumulate, the time is ripe for integrative theoretical efforts that will prevent 
raw empiricism from dominating the study of adolescence. 
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Abstract 


Improvements in the living standards of the elderly over the past 25 years 
have been dramatic. The rising affluence of the elderly has prompted a debate 
over what constitutes an equitable distribution of resources between genera- 
tions. The purpose of this essay is to review the evidence on the changing 
well-being of the elderly in the United States. While successive cohorts of 
elderly have attained higher levels of economic well-being, a review of the 
longitudinal evidence also shows that some elderly individuals, especially 
women and the chronically ill, remain economically vulnerable as they age. 
Whether Social Security and other public expenditures that benefit the elderly 
are equitable is a key issue in the debate over whether the economic position 
of the elderly is “fair”: We present two frameworks, a “pay-as-you-go” model 
and a pension model, within which equity issues are discussed. The central 
role played by health concerns in assessing the well“being of the elderly leads 
us also to review evidence on two important health issues: recent trends in the 
physical frailty of the elderly, and the likely burdens, both to the elderly 
themselves and to society at large, of providing medical care for future’ 
cohorts of elderly. 
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INTRODUCTION 


In the early 1960s, when one in three elderly men and two in five elderly 
women were poor, Gunnar Myrdal wrote of the “terrifying extent to which old 
people are left in poverty and destitution” in the United States. He thought 
that “it cannot possibly be the considered opinion of the majority of Amer- 
icans that so many . . . (elderly) should be left in misery, squalor and often 
forbidding loneliness, unattended though they are in deed of care” (1963, as 
quoted in ‘Schultz 1988, p. 17). 

Improvements in the living standards of the elderly over the next quarter 
century could hardly have been more dramatic. By the mid-1980s, the 
incidence of poverty among the elderly had fallen to one in eight, broad 
medical care was being provided through the Medicare system, and earlier 
retirement meant that the elderly had more leisure time in which to enjoy their 
higher incomes. 

The rising affluence of the elderly has prompted a debate over what 
constitutes an equitable distribution of resources between generations. Great 
publicity has been accorded the unprecedented situation in which the living 
standard of the typical elderly person—improved in large part by more 
generous Social Security retirement and medical benefits—is now higher than 
that of the typical child, for whom transfers have become less generous 
(Preston 1984, Binstock 1983). 

Two kinds of health concerns darken the otherwise bright picture of the 
well-being of the elderly. First, have medical advances actually increased the 
frailty of the typical elderly person by prolonging the lives of disabled persons 
who would otherwise have died? A second, economic, concern is to what 
extent costs of long-term care for disabled elderly, currently not covered by 
public health insurance programs, threaten the economic well-being of the 
elderly and their children. 

The purpose of this essay is to review the evidence on the changing 
well-being of the elderly. We begin with a review of cross-sectional data 
showing strong economic gains for the elderly, relative to other groups in the 
population, especially in the last two decades. Rising Social Security benefits 
and, to a lesser extent, income from assets account for much of the increase; 
these have more than offset falling earned income due to earlier retirement. 
But while successive cohorts of elderly have attained higher levels of eco- 
nomic well-being, a review of longitudinal evidence also shows that some 
elderly individuals, especially women, remain economically vulnerable as 
they age. 

Few would deny the elderly their income earned from savings, private 
pensions, or work. But whether Social Security and other public expenditures 
that benefit the elderly are equitable is a key issue in the debate over whether 
the economic position of the elderly is “fair.” We present two. frameworks 
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Figure I Average living standards of elderly and nonelderly individuals, by sex, 1949-1979. 
(Source: Ross et al 1987, Tablet) 


within which equity issues surrounding Social Security expenditures can be 
discussed. 

The central role played by health concerns in assessing the well-being of 
the elderly leads us also to review evidence on the: two important health 
issues: recent trends in the frailty of the elderly person, and the likely burdens, 
both to the elderly themselves and to society at large, of providing medical 
care for future cohorts of elderly. 


PATTERNS OF CHANGE IN THE ECONOMIC STATUS 
OF THE ELDERLY 


Ross et al (1987) track the relative economic position of elderly and nonelder- 
ly men and women using data from the 1950 to 1980 Decennial Censuses 
(Figure 1). Their measure of economic status—the income-to-needs ratio— 
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adjusts for family size differences by dividing each adult’s total household 
income by the poverty threshold relevant for his ‘or her household’s composi- 
tion and then adjusts for changes in the cost of living. (The poverty lines for 
households of one and four persons in 1988 were about $6,000 and $12,000, 
respectively). A ratio of 1.0 would mean that the average person in the group 
had a family income that was equal to the poverty line. 

Figure 1 shows a striking improvement for all groups. Income for the 
average elderly household during the post-war period was near or below the 
poverty line but by 1979 it had increased to nearly three times the poverty 
level. The absolute improvement in the living standards of the elderly over the 
three decades—roughly 2.0 income-to-needs units—was matched by 
nonelderly men and women; in relative terms, however, elderly men and 
women fared much better than did younger people. The differential improve- 
ment was particularly striking between 1969 and 1979, when the average 
living standard improved significantly for the elderly but barely kept up with 
inflation for nonelderly men and women. If anything, the elderly have 
continued to gain relative to the nonelderly during the 1980s.! 

Despite the differential improvement, the comparisons of income-to-needs 
positions in Figure 1 still show that the economic status of the average elderly 
person was below that of the average nonelderly adult in 1979. There are 
several reasons to suspect that additional adjustment to the income measure 
would eliminate or even reverse these differences. Danziger et al (1984) find 
that adjustments for the consumption benefits of owning homes and tax 
advantages have a measurable positive effect on the relative economic status 
of the elderly; further adjustments for the monetary value of medical and other 
in-kind benefits (Smeeding 1982), the apparent underreporting of income by 
the elderly (Radner 1982), and the value of their greater leisure time would 
only add to the relative position of the elderly. Important adjustments that 
might reduce the relative status of the elderly are for ill health, the dimensions 
of which are explored in a subsequent section of this chapter, and fringe 
benefits to the working-age population. On balance, however, there appears 
to be little doubt that the economic status of the typical elderly person, when 
measured in a fairly comprehensive way, probably meets or exceeds that of 
the typical nonelderly individual in the 1980s. 

Statements about the “typical” individual within a group often conceal 
important aspects of ‘the distribution of experiences, especially among the 
least well-off members (Quinn 1985). Evidence on time trends shows that, if 
anything, the economic status of low-income elderly has improved faster than 


‘Radner’s analysis (1987, Table 7) of income changes between 1967 and 1984 using data from 
the Current Population Surveys showed that between 1979 and 1984 the average annual percent- 
age change in real income for family units headed by an elderly person was 3.4 percent, while it 
was minus .4 percent for family units headed by a nonelderly person. 
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that of the average elderly person over the past several decades.” The in- 
cidence of poverty among elderly men fell from 59.4% in 1949 to 8.5% in 
1986 and among elderly women over the same period from 75.7% to 15.2%.? 
Placing a value on in-kind medical benefits and some of the other factors 
mentioned above would almost certainly accentuate these favorable trends 
(Smeeding 1986). 

But while the overall incidence of poverty among the elderly,has fallen to 
unprecedented levels, there are several reasons to view the economic situation 
of some elderly as a serious social problem. First, poverty among widows and 
members of minority groups is still quite prevalent: In 1986, poverty rates for 
individuals aged 65 and older by sex and ethnic group were as follows: white 
men, 6.9%; white women, 13.3%; black men, 24.2%; black women, 35.5%; 
Hispanic men, 18.8%; Hispanic women, 25.2%. Second, there appears to be 
a substantial concentration of elderly in the income range just above the 
Official poverty line who do not appear in the usual poverty statistics (Quinn 
1985). Third, the distribution of cash and in-kind benefits appears to leave the 
near-poor elderly especially vulnerable to economic or health problems 
(Smeeding 1986). And fourth, fluctuations in income and the possibility of 
costly medical expenses, described below, also place substantial numbers of 
nonpoor elderly at risk of economic hardship. 

In order to understand the issues surrounding the economic status of the 
elderly, it is important to distinguish between the experiences of different 
cohorts of elderly people and the experiences of a particular cohort as its 
members age. A tempting, but erroneous, interpretation of the rising afflu- 
ence of the elderly depicted in Figure 1 is that the economic environment of 
old age is quite favorable, with the household income of a typical elderly 
person rising with advancing old age. The problem, of course, is that com- 
parisons of same-aged elderly groups a decade apart confuse the experiences 
of the same individuals present at both times with the situations of a newly 
entering cohort of elderly and a departing cohort of elderly who did not 
survive the decade. 

Depicted in Figure 2 are the experiences of four five-year cohorts of 
individuals, as revealed by census data and reported in Ross et al (1987, Table 
3). For most cohorts, average household incomes rise with age from their 


*Radner (1987, Table 9) found that income inequality for family units headed by elderly 
individuals feil substantially between 1967 and 1979, and then rose slightly between 1979 and 
1984. The share of family income received by the bottom quintile of family units headed by 
elderly individuals rose from 4.6% in 1967 to 5.6% in 1984; comparable shares for the bottom 
quintile of nonelderly family units fell from 5.2 to 3.9%. 

The 1949 rates are reported in Ross et al (1987). The 1986 poverty rates and those presented 
below are based on Current Population Survey estimates as reported in US Special Committee on 
Aging (1987-1988), Table 2-7. 
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Figure 2 Average living standards for four cohorts of men and women. (Source: Ross et al 
1987, Table 3) 
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early 50s to mid-60s and then incomes fall as these cohorts enter their 70s. 
Thus, the economic environment of old age is not as favorable as one might 
infer from Figure 1. Also striking, though, is the greater affluence of the 
newer cohorts at every age: although somewhat reduced, the relative eco- 
nomic advantages enjoyed by more recent cohorts of elderly merely reflect the 
- advantages they enjoyed prior to retirement. Clearly it is important to distin- 
guish between aging and cohort factors. We do this first by reviewing what 
has been learned from longitudinal data on the dynamics of well-being in old 
age and then by examining cohort differences in economic status. 
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DYNAMICS OF WELL-BEING IN OLD AGE 


Tracking cohorts of elderly across time, based on census data as in Figure 2, 
is problematic. Not only may the selective nature of mortality lead to 
erroneous inferences about the income path of the typical individual, but also 
such tracking does not show the diversity of individual experiences. A still 
limited, but burgeoning, body of literature using several national longitudinal 
survey data sources has shown that the economic environment of old age is 
rather turbulent, much more threatening to women than men, and often 
reflecting a continuation of economic conditions prior to old age. 

Burkhauser & Duncan (1988) follow individuals who were aged 66 to 74 in 
1969 over the decade between 1969 and 1979, using data from the Panel 
Study of Income Dynamics. They find that while the average income-to- 
needs ratio of both the men and women in this group fell only modestly, a 
substantial minority (27%) within each group experienced at least one very 
substantial drop (i.e. of 50 percent or more) in economic status. One fifth of 
the men and more than one third of the women were poor at least once in the 
decade, while about one tenth of each group were poor more than half the 
time. They find that large changes in asset income, the death of a spouse, and 
other family composition changes involving grown children or other relatives 
figured prominently in producing the income fluctuations. 

Coe (1988) uses data from the Panel Study of Income Dynamics to compare 
the dynamics of poverty spells for elderly persons with those of the general 
population, as presented in Bane & Ellwood’s (1986) path-breaking article. 
He finds that the chances an elderly person will exit from poverty in the first, 
second, or third year of a spell are just as high as for the nonelderly, but that 
the exit probabilities fall substantially after the third year, producing an 
average poverty spell length for the elderly (5.3 years) that is considerably 
longer than for the nonelderly (4.2 years). That nearly two thirds of the 
elderly escape poverty spells within three years is a surprising and little- 
understood aspect of the economic well-being of old age. 

Burkhauser et al (1988) take a careful look at the economic consequences 
of retirement, the death of a spouse, and pension choice, using data from the 
Retirement History Survey (RHS) and family income-to-needs as a measure 
of economic status.* Their longitudinal data clearly show that the average 
economic position of both intact couples and eventual widows declines with 
time, from about 80% of preretirement income-to-needs immediately follow- 


The Retirement History Survey interviewed households headed by a person aged 58 to 63 in 
1969 and repeated the interviews at two-year intervals over the following ten-year period. 
Burkhauser et al (1988) restrict their analysis to couples who were nonpoor and intact at the time 
of retirement. 
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ing retirement to about 50% seven to eight years later. Not surprisingly, both 
the decline in average living standards and the risk of poverty were greater for 
widows than for intact couples. More than one fifth of the eventual widows 
were poor at least once in the eight years period following retirement, as 
compared with less than one tenth of the intact couples. 

A somewhat more controversial issue addressed by Burkhauser et al (1988) 
is the role played by the husband’s decision for or against a pension that 
provides spousal benefits after his death.” They find that pension choice has a 
substantial effect on the economic status of widows, although subsequent 
calculations using the same data by Holden & Burkhauser (1988) show that 
many of the economic advantages enjoyed by widows who receive spousal 
benefits were present prior to the death of their husbands. 


COHORT IMPROVEMENTS IN ECONOMIC 
WELL-BEING | 


There is little doubt that recent cohorts of elderly are much better off finan- 
cially than are older cohorts. A useful starting point for our discussion of the 
reasons for improvement is to examine the income packages of elderly 
families at various points in time. Figure 3 uses income data from the Current 
Population Surveys covering 1967, 1979, and 1984 to decompose incomes of 
family units headed by an elderly individual into four categories: earnings, 
Social Security, property, and all other income.® 

The average income of elderly family units rose by 55% between 1967 and 
1984, despite a large drop in the average amount of earned income. Although 
pay rates older workers can command in the labor market have risen sub- 
stantially over the past several decades, falling retirement ages have more 
than offset the higher earnings, producing the falling amount—and share—-of 


-Myers et al (1987) find that the majority of widows in the Retirement History Survey (RHS) 
lost all pension income when their husbands died. RHS data cover the period from 1969 to 1979, 
The Retirement Equity Act of 1984 required workers, beginning in 1986, to obtain their spouse’s 
signature if they opted for a single-life pension. This legislation may well increase the prevalence 
of pensions with spousal benefits; it does nothing for widows whose spouses had no pension and 
is therefore likely to have a rather limited effect on the poverty rates of older women (Myers et al 
1987). 

‘Data used in Figure 3 come from Radner (1987, Tables 12 and 16) and are expressed in 1982 
dollars. Income amounts are adjusted for family size using the poverty threshold adult equivalent 
scales and are normalized on one-person family units. For example, income amounts for units 
consisting of a single elderly individual are divided by .943; the division factor for units with two 
elderly persons is 1.190. Unrelated individuals are treated as one-person family units. The 
“Property” income category includes interest, dividends, rent and income from estates and trusts. 
The “Other” category includes public transfers such as Supplemental Security Income as well as 
public (non-Social Security) and private pensions and several other miscellaneous income 
sources. 
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Figure 3 Composition of income packages of family units headed by an elderly individual, 
1967, 1979, and 1984. (Source Ross et al, Table 3) 


earnings in the total family income package apparent in Figure 3. In 1967 
earned income accounted for 43% of average total income of elderly family 
units; by 1984 its share had fallen to 20%. 

In.contrast, Social Security benefits have gained in importance, doubling in 
average amount between 1967 and 1984 and increasing as an average share 
from 28% to 36%. By the late 1970s, Social Security benefits had become the 
dominant source in the income packages of the elderly. 

The wealth of the elderly grew in other ways as well, leading to a tripling of 
the average amount of rent, dividend, and interest income between 1967 and 
1984, and a doubling of the share of income coming from those sources. By 
1984, property income had become more important than earned income in the 
income packages of the elderly. Asset income from pensions other than Social 
Security (included in the “Other” category in Figure 3) was considerably less 
important than property income in 1984, although its share increased between 
1967 and 1987. 
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In sum, recent cohorts of elderly are much better off economically than are 
older cohorts. The improvement is due largely to rising Social Security 
benefits, combined with greater amounts of income from financial assets and 
private pensions. Increases in these sources have more than offset falling 
earnings, to produce a sharp improvement in the average economic position of 
the elderly. 


IS THE INCREASING AFFLUENCE OF THE ELDERLY 
FAIR? 


The improved economic status of the elderly contrasts sharply with the 
economic fortunes of other groups in the population, especially children. 
Between the late 1960s and mid-1980s the poverty rate for the elderly fell by 
half, to about 13%, while the rate for children rose by half, to about 20% 
(Smolensky et al 1988). The debate over the fairness of this turnabout in the 
well-being of America’s two largest dependent groups has raged throughout 
the 1980s, and it shows no sign of abating. 

A comprehensive review of both positive and normative issues in the 
debate over what constitutes an “equitable” distribution of income between 
generations is beyond the scope of this review article (see Palmer et al 1988 
and Kingson et al 1986 for a more complete discussion). We can, however, 
summarize some of the central facts and insights in the argument, and we 
devote considerable attention to equity aspects of Social Security—the largest 
and by far most important program that redistributes wealth between genera- 
tions. 

Smolensky et al (1988) review evidence from Decennial Censuses since 
1940 and show that the divergent economic paths of children and the elderly 
are a recent and brief anomaly in the longer-run trend of similar and sub- 
stantial gains for both groups. The divergence in the fortunes of the elderly 
and children first emerged in the early 1970s and has accelerated during the 
1980s. However, crucial changes in the nature of the economic dependence of 
both groups may mean that the recent past is indeed to a large extent 
predictive of the future. 

Smolensky et al (1988) observe ron until recently, the economic status of 
both children and the elderly was determined almost exclusively by the 
income of the working men and women with whom they resided. The 
explosion in the number of female-headed families and the growth of transfer 
programs such as Aid to. Families With Dependent Children have shifted the 
burden of supporting some children from the labor market to transfer pro- 
grams. For the elderly, an increasing propensity to live independently, apart 
from grown children, combined with earlier retirement and more generous 
Social Security benefits have almost completely severed their dependence on 
current earnings. 


AFFLUENCE OF THE ELDERLY 271 


The discrepancy in the economic situations of children and the elderly since 
the early 1970s can be accounted for, in large part, by the differential success 
of the labor market vs income transfers such as AFDC and Social Security 
benefits in keeping family incomes growing faster than inflation. The differ- 
ences are dramatic, with earnings and AFDC benefits falling in real terms, 
while Social Security benefits more than kept up with inflation. By 1980, the 
average Social Security benefit paid to retired workers and their wives 
amounted to 55% of median male earnings, 30% higher than the official 
poverty line for an elderly couple and nearly twice as high as the average 
AFDC benefit paid to a three-person family (Smolensky et al 1988, Table 
3.5). All of these figures are sharply higher than a decade before. Although 
some levelling off can be observed, similar trends are likely to persist. The 
key role played by the Social Security system in producing this divergence 
and the complications involved in assessing the system’s “equity” lead us to 
focus considerable attention on it. : 


Equity Aspects of Social Security Benefits 


Begun in the midst of the Great Depression as an old age retirement benefit 
program, the Social Security system has since expanded to include life and 
disability insurance coverage for nonelderly workers and, in 1965, the com- 
prehensive elderly health program, Medicare (Davis & Rowland 1986). The 
retirement program makes up the largest part of Social Security and is the 
focus of our discussion.” In 1974, benefits began to increase automatically 
with inflation—a key development, since the decade that followed saw 
unprecedented inflation that led prices to rise faster than wages. 

By 1988 the System’s expenditures totaled nearly $300 billion, most of 
which are financed by a flat rate tax on the annual earnings of workers up to a 
maximum level (currently around $40,000). Half of the tax is deducted from 
employee paychecks; the other half is paid, at least initially, by employers, 
but may well be borne in the long run by employees in the form of lower 
earnings (Brittain 1971). In 1990 the combined tax rate is scheduled to be 
12.4% of earnings, up from the 10.8% rate that prevailed in the early 1980s. 
Social Security retirement benefits are, for the most part, not treated as 
taxable income. Benefit levels are geared to career earnings although, as we 
shall see, they replace a much higher fraction of the career earnings of 
low-wage than high-wage workers. 


Pay-As-You-Go 
The Social Security system is currently financed on a “pay-as-you-go” basis, 
with payroll taxes collected each year paying the bulk of the expenditures 


"Equity aspects of financing Medicare medical benefits are considerably more complicated 
and not addressed here. 
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incurred in that same year. Some analysts (e.g. Pechman et al 1968) have 
argued that pay-as-you-go has a compelling theoretical justification analogous 
to that of a multitude of other current-period tax and transfer programs. In this 
view, revenue burdens on current workers ought to be evaluated according to 
standard criteria such as ability to pay; benefits ought to be allocated accord- 
ing to society’s judgments of need; and there need not be any linkages 
between the taxes paid by an individual during his working life and the 
benefits he receives in retirement. Thus, the pay-as-you-go system is a kind of 
social insurance in which each generation contributes to the care of the 
currently retired in return for a promise that the next generation of workers 
will do the same. Rather than explicitly linking past contributions and current 
benefits, pay-as-you-go decisions about benefit levels and payroll tax rates are 
based more on political than actuarial considerations. 

In a steady-state world in which all workers contribute taxes throughout 
their working lives, pay-as-you-go does produce a direct linkage between a 
given worker’s tax contributions and his retirement benefits, with each cohort 
receiving a return on their payroll tax “investment” equal to the sum of the 
population growth rate and the rate of growth of real wages (Aaron 1966). 
The real world is not as cooperative, since differential rates of inflation and 
real wage growth, different cohort sizes, and shorter-than-full-lifetime tax 
contributions cause large imbalances between system expenditures and costs. 
A short-run “crisis” in Social Security financing in the early 1980s was 
created by the combination of stagnant wages (and therefore payroll tax 
contributions) and inflation-indexed retirement benefits. A longer-run “crisis” 
was projected to loom in the twenty-first century when baby-boom retirees 
would threaten to drop the ratio of workers to beneficiaries from 3.3 in 1980 
to 2.1 in 2030, causing the system. to run large deficits.’ 

Important changes in the Social Security system were implemented in 1983 
and are instructive in the application of pay-as-you-go principles. Rather than 
turning outside the system to other revenue sources (such as the personal or 
corporate income tax), the commission recommended and Congress legislated 
a political compromise in which current beneficiaries bear some costs, pri- 
marily with a delay in cost-of-living adjustments and income taxation of the 
benefits of high-income beneficiaries. Current workers bear the bulk of the 
costs in the form of higher payroll tax rates and an expansion of earnings 
subject to taxation. Longer-run imbalances were “solved” in large part with an 
increase from 65 to 67 in the retirement age at which full benefits could be 
received, an increase to begin early in the twenty-first century. “Equity” in 
this compromise appeared to consist of spreading around the costs among 


®These ratios come from the “Intermediate (B) Projections” in the 1983 Trustees’ Report as 
reported in Thompson (1983) and are generally regarded as the most reasonable. 
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current beneficiaries and current workers (who are, of course, future benefi- 
ciaries), with due regard to the political power held by each group. 


The Pension Model 


An alternative way to view Social Security retirement benefits is as a public 
analogue to a private pension system. In order to make the issues clear, it is 
first helpful to describe the salient features of private pensions and then to 
examine Social Security in their light. 

In a private pension system, workers contribute (either explicitly or im- 
plicitly, in the form of lower earnings) to a company pension fund, which, in 
turn, is invested in stocks, bonds, and other financial instruments which 
accumulate dividends, capital gains, and interest. At the point of retirement, 
workers receive pension benefits either geared explicitly to their contributions 
plus accrued interest (as in a defined contribution plan) or as some fraction of 
their earnings in the years just prior to retirement (as in a defined benefit 
plan). 

In either case, but most clearly with defined contribution plans, an impor- 
tant feature of benefits is that the bulk of them constitute accrued interest 
rather than worker contributions. A person making contributions for 40 
working years and then receiving benefits over a 15-year retirement period 
will receive between one and six dollars in interest for every dollar in 
contribution, depending on the interest rate.? Thus, in a private pension 
system, workers “deserve” benefits that are much higher than their contribu- 
tions. 

Although President Roosevelt originally proposed the establishment of a 
fully funded pension fund for the retirement portion of the Social Security 
system, Congress initially legislated a system with only partial funding. 
Furthermore, when actuarial projections showed that the fund would accumu- 
late large reserves, Congress spent much of the reserves by expanding the set 
of beneficiaries to include individuals who would never make substantial 
contributions to the system but would otherwise have needed public transfers 
from income-tested programs that have some stigma attached to them. A fully 
funded system would in any case be utterly impractical under the current 
program, as it would lead to government ownership of trillions of dollars of 
American business stock, which is unnecessary given the power of the US 
government to tax to make good on its obligations. But while there is no 


°At a 3% interest rate, interest constitutes 58% of total benefits; at a 6% rate the comparable 
fraction is 83%. Unfortunately, using market interest rates in these calculations is misleading, 
because most of the resulting accumulation reflects inflation rather than inflation-adjusted real 
increases in purchasing power. It is safer to use interest rates of 2%~-3% in these calculations 
because they represent the historical difference between nominal interest rates and the rate of 
inflation. 


214 DUNCAN & SMITH 


Social Security pension fund analogous to private sector funds, one can still 
evaluate aspects of the Social Security system within the context of a pension 
model. 


The Pension Model’s Equity Rule 


Under the pension model of Social Security, workers contribute through 
payroll taxes during their working years, and then they and their spouses 
receive benefits throughout their retirement years. The equity principle for 
judging optimal benefits is straightforward: Each generation of beneficiaries 
is entitled to benefits that represent its payroll tax contribution plus accrued 
interest, where the interest accrual is similar to what would have been earned 
during the same period of accumulation by a private pension fund. Benefits in 
excess of what a private pension would pay constitute a transfer to the 
beneficiaries. 

There are several important qualifications and implications of the pension 
model’s equity rule. First, it applies to each generation of beneficiaries taken 
as a whole, and not necessarily to individual beneficiaries. As with private 
pensions (or any other insurance for that matter), there can be redistribution 
within generations of beneficiaries without violating the equity rule. In- 
dividuals who die early in retirement receive a much lower level of benefits 
than long-lived individuals—an insurance-based redistribution from the un- 
lucky to the lucky. 

Somewhat more controversial is redistribution within sence of Social 
Security beneficiaries owing to the much higher career benefits-to-carnings 
ratios for low- than high-wage workers and for married couples and single 
female earners than for single male earners.'° Some analysts argue for a 
Social Security model that includes such redistribution; others view Social 
Security as pure insurance and feel that there should be no such redistribu- 
tion. !! 


Rates of Return 


The pension model views the relationship between payroll contributions and 
benefits as a crucial criterion in evaluating the system’s equity. As with 
private pensions, calculations that show that the typical worker gets much 


Thompson (1983, Table 1) reports calculations from the Social Security Administration of 
benefit-to-earnings ratios for workers with uninsured spouses retiring at age 65 in 1983. Workers 
who worked half-time at minimum wage jobs throughout their careers would receive benefits 
equal to 1.35 times their earnings just prior to retirement. Comparable rates for full-time 
minimum wage workers, full-time workers with average earnings and workers at the maximum 
taxable earnings are .95, .69 and .39, respectively. 

. !Thompson’s (1983) review article gives a full set of references to the so-called “annuity- 
welfare” model, which has an explicit role for redistribution within generations, and the “in- - 
surance” model, which does not. 
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more out of the system than he puts in are irrelevant; workers are entitled to 
the large amount of interest earned—explicitly in private plans and implicitly 
in the Social Security system—on their contributions. 

More relevant are so-called “rate of ‘return” calculations, which show the 
implicit interest rate that equates inflation-adjusted benefits with contribu- 
tions. A 3% inflation-adjusted rate of return is often taken to be a useful 
dividing line between above- and below-market rates of return. Rates that 
differ by as little as one percentage point may seem similar, but in fact make a 
significant difference in the size of total benefits and costs. 

Boskin et al (1987) calculate rates of return for various cohorts of single- 
earner couples incorporating all of the changes. introduced in 1983 and 
reasonable assumptions about life expectancy and other parameters that de- 
termine benefits and costs. Shown in Figure 4 are their rate-of-return figures 
for high, medium, and low wage workers.'* Cohorts with rates of return in 
excess of 3% can be thought of as receiving both pension benefits and 
transfers. One can consequently use a 3% rate to discount tax contributions 
and benefits in order to allocate benefits into a portion that represents a 
pension payment and a portion representing a transfer to the beneficiary. 
Figure 4 also shows the resulting ratio of transfers to total benefits. 

Figure 4 shows both the dramatic decline—to below market levels—in 
rates of return for recent cohorts and the substantial redistribution from high- 
to low-wage workers within cohorts. Cohorts born in 1915, who turned 65 in 
1980, receive very high returns, with about half of their benefits representing 
a transfer. High- and medium-wage workers in cohorts born in 1930 will 
receive essentially no transfers and thus earn rates. of return close to the 3% 
rate, while medium- and high-wage workers in cohorts born in 1960 and 
beyond are scheduled to receive far less than would be warranted by a 3% 
return on their payroll contributions. Thus, the pension model would view 
current beneficiaries (i.e. the 1915 cohort) as receiving rather large transfers, 
with current workers facing the prospect of earning below-market rates of 
return on their payroll tax contributions. 


Balancing Revenues and Expenditures 


In contrast to the emphasis of the pay-as-you-go model on balancing current- 
period expenditures and revenues, there need not be any relationship in the 
pension model between expenditures and costs at any given point. The 
relative sizes of the population of current retirees and workers have no bearing 
on the level of retiree benefits, and the pension model views each generation 
of retirees as entitled only to a market return on its contributions. Implementa- 


High, medium and low age levels are defined as $50,000, $30,000 and $10,000, respective- 
ly, in 1985 dollars. These figures were chosen to correspond roughly to the earnings levels of 
well-paid professionals, median workers and poverty-line workers. 
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Figure 4 Social Security rates of return and ratio of transfer to total benefits for various cohorts 
of workers. (Source: Boskin et al 1987, Table 1A) 


tion of a pension-based benefit formula in a world of different cohort sizes 
may at times require the use of revenue sources outside the system, while at 
other times revenues will exceed expenditures and could either accumulate or, 
if prudent fiscal policy dictates, be spent for other purposes. 

The pension model offers a different view of the changes implemented in 
the 1983 reforms. Rather than helping to equalize the disparate rates of return 
between current and future generations of retirees, these changes will have the 
opposite effect. Changes such as delayed cost-of-living increases that affect 
current retirees lower the rather high rate of return enjoyed by current 
beneficiaries. But the most significant changes—in particular the jump in the 
payroll tax rate and the.increase in the retirement age from 65 to 67—-lower 
still further the rate of return to current generations of workers. 


Are Social Security Benefits Fair? 


Whether Social Security costs and benefits are equitably distributed, then, 
depends on the model under which one chooses to view them. The pension 
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model views current beneficiaries as receiving rates of return that are con- 
siderably higher than what the private market would pay, while future benefi- 
ciaries are likely to receive below-market rates. Recent “reforms,” con- 
strained by the level of pay-as-you-go payroll tax revenues, increased rather 
than decreased the disparities. 

The pay-as-you-go model views all benefits to current beneficiaries as a 
kind of transfer, while the pension model views only the higher-than-market- 
return portion of returns to current beneficiaries as representing transfers. 
Such transfers may be perfectly appropriate if.so judged by society, but, 
according to advocates of pay-as-you-go, they ought to be evaluated in the 
context of the whole set of possible societal transfer and tax decisions 
(Kingson et al 1986). Given the extremely broad-based support accorded to 
Social Security expenditurés, it is quite possible that current transfers to the 
elderly—even as defined by the pension model—do indeed reflect a societal 
consensus. !? : 


HEALTH STATUS AND THE ECONOMIC WELL-BEING 
OF THE ELDERLY 


The rising prosperity of the elderly has been accompanied by two important 
developments that may affect their financial standing: increasing longevity 
with possibly increasing frailty and mounting out-of-pocket health care ex- 
penses, particularly for nursing homes. If recent cohorts of elderly are either 
becoming more frail or incurring more medical costs, then measures of 
economic well-being based on income may overstate the true financial status 
of the elderly. To further determine whether or not the aged are indeed 
becoming better-off, we examine recent evidence on the relationship between 
their longevity, morbidity, and health expenditures. 


'3For example, the 1986 General Social Survey, taken after the 1983 reforms, showed that a 
majority (56%) of respondents thought that the country was spending “too little” on Social 
Security. If anything, younger respondents were more supportive of increased spending, which 
suggests that organizations lobbying on behalf of increased spending for the elderly are tapping a 
deep reservoir of support for transfers to the elderly among voters of all ages (Ponza et al 1988). 

It may seem that support for Social Security among the nonelderly runs contrary to self- 
interest. Of course, many of the nonelderly have elderly parents or other relatives who are current 
beneficiaries, and Social Security benefits may be a much smoother way of transferring money 
from adult children to their elderly parents. But it is also in the financial self-interest of many 
current workers to support the system. For example, the below-market rates of return shown for 
median earners in the 1945 birth cohort reflect calculations based on likely benefits and total 
payroll contributions. Boskin et al (1987) show that if one compares likely benefits with future 
payroll contributions, then the rate of return jumps from 2.1% to 4.5%. Thus, workers who 
consider past contributions to be “sunk costs” should view continuing the system as very much in 
their self-interest. 
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Increasing Longevity and the Elderly 


There has been a dramatic increase in life expectancy at birth during this 
century for both men and women (Chapman et al 1986). While these gains are 
partly the result of a decline in infant and childhood mortality, significant 
improvements in longevity have also been observed at all ages in adulthood. 
Perhaps most remarkable has been the significant and unexpected decline in 
mortality among the elderly. For example, in 1935, when the Social Security 
system was established, white men and women aged sixty could expect to live 
approximately fifteen more years (National Center for Health Statistics 1987). 
This figure had grown to eighteen years for white men and twenty-three years 
for white women by 1984. A consequence of increasing longevity has been a 
steady rise in the proportion of elderly in the population. Only 4.1% of the 
population was over age 65 in 1900, increasing to 11.6% in 1982; that 
proportion is expected to rise to 13% by 2000 (Spencer 1984). More impor- 
tant in terms of health, those over age 75 comprise a growing proportion of 
the elderly —by 2000 half of those over 65 will be older than 75, up from 30% 
just 50 years earlier. 


Models of Health Status When Life Expectancy Increases 


Predicting changes in average health status among the elderly during a period 
of increasing longevity is difficult because of the complex relationship be- 
tween mortality, morbidity, and disability. It is crucial to examine the dynam- 
ics between the onset of fatal and nonfatal illnesses, on the one hand, and the 
expected duration of the disability produced by these diseases. 

As the average life span increases among the aged, to what extent will their 
additional years be free of disease, and to what extent will the increase be due 
to a rise in the number of frail elderly, who otherwise would have died in the 
absence of life-saving advances in-medical technology and whose lives will be 
lengthened, but marked by poor health? Fries & Crapo (1981) and Manton & 
Soldo (1985) provide a useful framework for understanding the debate: over 
these two questions. They propose two “survival” curves, one representing 
the current mortality patterns and the other showing the time of the first 
significant chronic disease. The area between these two curves represents the 
cumulative morbidity years lived by the population between the onset of a 
chronic illness and death. 

What happens to cumulative morbidity years when life expectancy in- 
creases—i.e. when the survival curve for mortality becomes more “rectan- 
gularized”? Gruenberg (1977) argues that the age at onset of morbidity will 
not change because the health care system focuses not on prevention, but on 
the management of disease. This model predicts an extension of life marked 
by additional years of poor health. In other words, the healthy elderly go on to 
live more healthy years and they benefit least from the medical and lifestyle 
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changes that have produced declines in elderly mortality. The frail elderly, on 
the other hand, may now live longer because of these changes, but they do so 
in poorer health. This position predicts that an increase in life expectancy 
translates into an increase in cumulative morbidity years. That is, the survival 
curve for mortality becomes more rectangular, but the curve representing the 
first significant morbidity event remains unchanged, resulting in an increase 
in person-years of morbidity. z 

Fries (1980, 1983) has proposed an alternative prediction based on a theory 
of the compression of morbidity. He contends that the elderly may experience 
a postponement of the age at which chronic disease symptoms first appear and 
suggests that the age at which the first significant onset of disability occurs 
will rise more quickly than will life expectancy. He argues for the existence of 
a “natural death” without disease, because all major organs decline in 
functioning from birth to death simply as a result of senescence, even without 
any specific pathology. This means that as life expectancy increases, a 
person’s age at the time of the first significant chronic illness increases at an 
even greater rate and cumulative morbidity years decline. Thus, morbidity 
should become increasingly concentrated in the years just before death. 

Manton (1982) suggests that a type of “dynamic equilibrium” exists which 
affects the duration of morbidity associated with chronic illnesses. He theo- 
rizes that the average duration of a chronic disease may increase in-one of two 
ways. First, the rate of progression of the disease may be slowed, which 
would increase longevity and the prevalence of disability but reduce the 
severity of the disease. Alternatively, the rate of progression of the disease 
may remain unaltered but the lethal complications of the disease (e.g. 
pneumonia), are eliminated. Manton feels that a large share of the gains in life 
expectancy have resulted from the first mechanism, but it is difficult to predict 
what effects these changes will have on the societal costs of caring for the 
elderly. He suggests that medical costs will rise in order to reduce the severity 
of the chronic diseases and, accordingly, to lower the mortality rate. Costs 
might also increase because the prevalence rate of many chronic conditions 
will naturally rise. However, these health care expenditures may be offset by 
gains in the economic productivity of the elderly and increases in the length of 
their working lives. 


Empirical Evidence on the Relationship Between Longer Life 
and Health Status Among the Elderly 


The problem with assessing the changing patterns of morbidity during periods 
of increasing life span is that no systematic data are available that adequately 
measure the timing of the onset of chronic illnesses relative to the timing of 
death. The issue is whether the elderly are becoming more or less healthy, on 
average. Manton & Soldo (1985) question the idea that survival curves are 
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becoming more rectangularized because of reduced mortality at older ages. If 
the elderly live longer while enjoying proportionately fewer years with 
chronic illnesses, then one would expect the average age at death to rise, but 
also the distribution of ages at death to decline over time. In other words, as 
the elderly approach the natural limits of the life span, there should be an 
increasing concentration of the timing of death. However, Myers & Manton 
(1984) and Manton & Soldo (1985) report that while male and female elderly 
have enjoyed longer lives between 1960 and 1980, this change has not been 
accompanied by a greater concentration of ages at death. 

In order to assess the changing living standards among the elderly and their 
sensitivity to changes in health conditions, we concentrate on the con- 
sequences of changing morbidity patterns rather than on specific diseases. 
The National Health Interview Survey (NHIS) provides valuable information 
for tracking the changing health status of the elderly. The NHIS is based on 
interviews with a national sample of households among the noninstitutional- 
ized population and has been conducted continuously since 1957. 

Figure 5 shows changes in health status for men and women over age 65 
during the past two decades. Two different health status measures are shown: 
annual restricted activity days due to acute conditions, and annual bed days 
resulting from acute conditions. Again, these figures are based on the nonin- 
stitutionalized elderly population. 

Restricted activity days are the most inclusive measure of disability because 
they include days spent in bed, missed days from work or school, and days 
when a person limits the things he or she usually does (i.e. cut-down days). 
There is no clear trend in restricted activity days for either men or women, 
although the younger noninstitutionalized elderly (ages 65-74) did experience 
a rise during the 1970s. Bed disability days better represent more severe 
short-term disability than do work-loss, school-loss, or cutdown days. Figure 
5 shows again that there are no clear improvements nor deteriorations in 
health among the elderly. 

Trends are also stable as regards limitations in major activity (e.g. work, 
school, housework) due to chronic conditions, and they do not correspond to 
declines in mortality rates during this time period. For men, no clear trend can 
be detected with the possible exception of a decline in limitations during the 
1980s (following significant changes in the structure of the NHIS question- 
naire in 1982). For elderly women, the pattern is one of relatively modest 
increases coupled with decreases in disability, with no apparent trend except, 
again, for a modest decline during the 1980s. 

Verbrugge (1984) also used the NHIS to track the health status of middle- 
aged and older individuals; however, she had no data available to her from the 
1980s and did not distinguish between younger and older elderly. She con- 
cluded that there has been some decline in health during the 1970s, and that 
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Figure 5 Changes in annual restricted activity days among the elderly, 1961-1980. (Source: 
Adapted from Poterba & Summers 1987, Table 9) 


restricted activity days for all conditions have increased gradually for those 
over 65. This occurred most significantly during the 1970s, the same period 
that witnessed declining heart disease mortality. Trends shown in Figure 5 
indicate that total restricted activity days are gradually rising only for the 
younger elderly. Verbrugge’s conclusion that there is increasing disability 
among the elderly may therefore apply only to this subgroup. The percentages 
of those limited in their major or secondary activities (either in the labor 
market or the home) have also slightly increased for all elderly (65 years or 
older). However, Baily (1987) finds that the percentage of elderly men and 
women who were unable to perform their major activity (a more stringent 
measure of disability than “any” limitations in major activity) between 1960 
and 1980 was relatively constant; any increases were largely attributable to 
changes in the design of the NHIS survey. : 
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An examination of disability rates among the noninstitutionalized elderly 
yields only a partial view of health trends because those in poorest health are 
not surveyed (Brody et al 1987). Using the 1982 National Long-Term Care 
Survey (NLTCS) of noninstutitionalized persons over age 65 conducted by 
the Health Care Financing Administration (HCFA), Manton & Soldo (1985) 
find that disability rates among the elderly remain relatively stable as one 
moves from one age category to the next. This is largely because the most 
severely disabled persons, who are unable to live in the community, have 
been selected out of the surveyed population (i.e. NHIS) as a result of 
mortality or institutionalization. 

It is not yet clear whether more recent cohorts of elderly, with their higher 
rates of survival, will show higher rates of disability among the noninstitu- 
tionalized (i.e. they will survive in greater numbers to become disabled but 
remain healthy enough to avoid institutionalization) or will have higher rates 
of institutionalization (i.e. they will survive in greater numbers only to 
succumb to serious disabilities later in life). Figure 6 shows that since 1963 a 
small rise has occurred in the annual percentage of younger elderly (ages 
75-84) living in nursing homes, although a small decline is observed in 1982. 
However, the oldest old (85 or older) show the most substantial and sustained 
decline in the percentage living in nursing homes since the early 1970s. 
Poterba & Summers (1987) argue that this may indicate the improving health 
of the oldest old relative to younger elderly. They assume that non-health 
factors, such as the availability of financial assistance and home health care, 
operate equally among all elderly who are in need of nursing home care. 
Therefore, they argue that the recent drop in nursing home residency rates 
among the oldest old is likely to be the result of improving health status. 

Soldo and Manton (1985) and Doty (1986) expect institutionalization rates 
for the elderly to increase because of reduced opportunities for alternative 
home care. For example, elderly parents may receive less care from their 
adult children since those children, elderly themselves, have health concerns 
of their own to contend with. 

According to Poterba & Summers (1987), the relatively stable measures of 
morbidity over the past 20 years indicate that general improvements in 
morbidity are offset by higher rates of morbidity among “marginal survivors” 
who are living longer lives. While they conclude that longer life does not 
necessarily mean worsening health, they imply that current and possibly 
future cohorts of elderly will endure the same rates of morbidity but for more 
years. They adopt a position somewhere between that of Fries (1983) and 
Gruenberg (1977): on average, the elderly are living longer, but their patterns 
of disability remain the same. 

Overall, it appears that the elderly have not benefitted from the improve- 
ment in health predicted by Fries (1983). Instead, the noninstitutionalized 
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PERCENTAGE LIVING IN NURSING HOMES 





YEAR 


Figure 6 Proportion of elderly living in nursing homes, 1963-1982. Source: Adapted from 
Poterba & Summers -1987, Table 7) l i 


elderly are experiencing steady rates of morbidity and disability. Disability 
among the elderly in the community has remained relatively constant, both 


when rates of institutionalization were increasing and now when they are 
decreasing. 


Changes in Morbidity and the Economic Status of the Elderly 


Medical costs associated with care of the elderly pose a threat to recent 
economic gains made by this group. Before we discuss the expenses incurred 
directly by the elderly, it is important to describe briefly the primary sources 
of funding for medical care for the elderly: Medicare and Medicaid (see Davis 
& Rowland’ 1986 and Gornick et al 1985 for a full discussion of these 
programs). Medicare went into effect during July 1966 and was designed to 
reduce financial burdens and therefore increase access to medical care for the 
elderly, the age group with the highest rates of illness and the lowest income. 
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Medicare has two components: hospital insurance (Part A) and supplementary 
medical insurance (Part B). Part A covers the majority of hospital costs 
associated with acute inpatient hospital care as well as a fixed number of days 
in skilled nursing facilities. In general, the beneficiary must pay a small 
out-of-pocket inpatient hospital deductible, which rose from $40 in 1967 to 
$400 in 1985. To avoid further deductibles and coinsurance stipulations, 
many elderly purchase so-called Medigap health policies to cover additional 
expenses (Rice 1987). The Part A trust fund relies on revenues generated from 
a payroll tax of current workers. Part B is designed to provide payments for 
physician expenses and physician-ordered services and supplies. The in- 
dividual must pay a monthly premium ($3 in 1967, $15.50 in 1985) as well as 
a yearly premium. For each accepted claim for payment for physician ser- 
vices—and not all claims are granted—the Medicare beneficiary must pay 
20% of the allowed expenses. Part B is funded by enrollee premiums as well 
as general tax revenues. 

Medicaid covers many medical expenses incurred by the poor and near- 
poor elderly that are not covered by Medicare. Perhaps one of its most 
important functions is to provide coverage for nursing home expenses. In 
order to become eligible, individuals must “spend down” their savings and 
assets to satisfy the program’s income and asset restrictions (i.e. in 1988, an 
individual is eligible if he has less than $1900 in income and assets). 

While both Medicare and Medicaid will continue to undergo changes that 
will directly affect the economic status of the elderly (e.g. greater contribu- 
tions by the elderly; see Davis & Rowland, 1986), there is no question that 
this group will continue to show the highest health care utilization rates and, 
correspondingly, health care costs. Recent trends indicate that the differential 
in this respect between the elderly and younger cohorts is growing (Lubitz & 
Deacon 1982). This is in keeping with the higher rates of self-reported 
disability among the elderly, relative to the rest of the adult population. Davis 
& Rowland (1986) note that the rate of hospitalizations is growing faster for 
those over 65 than those under 65.'4 

Are the elderly seriously threatened by the potentially catastrophic costs of 
health care, or are they protected by public and private medical insurance 
programs? Gornick et al (1985) and Waldo & Lazenby (1984) show that the 
percentage of all medical costs paid out-of-pocket declined among the elderly 
from 53.2% in 1966 to 29.3% in 1977, then to 25.2% in 1984. Correspond- 
ingly, the portion of medical expenses paid by Medicare rose from zero to 
44.1 to 48.8%. 

Out-of-pocket medical expenses among the elderly have continued to be a 


141n 1967, those under 65 had fewer than 130 admissions per 1000 population and this figure 
increased to only 135 admissions per 1000 by 1979. The comparable figures for those over age 65 
are 260 and 350 in 1967 and 1979, respectively. 
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small portion of their personal income. The US Senate’s Special Committee 
on Aging (cited in Gornick et al 1985) estimated that the elderly spent 
approximately 15% of their person income on out-of-pocket medical expenses 
prior to Medicare. While this figure dropped to 12% in 1977, it was back to 
15% in 1984. However, because of accelerating inflation in medical care 
costs, the proportion of out-of-pocket health care expenditures paid by the 
elderly is expected to rise to 18% in 1989 (Rabin & Stockton 1987). 


Impoverishment and Nursing Home Costs 


There is little doubt that many medical expenses incurred by the elderly for 
acute conditions have been covered by Medicare. In spite of these benefits, 
many disabled elderly remain economically vulnerable to the high costs of 
nursing home care, most of which are paid by the elderly themselves and their 
families. What are the financial implications for an elderly person and his 
family when he enters a nursing home? 

Medicaid coverage has reduced the institutionalization costs borne by the 
individual from their pre-Medicaid levels. For example, two years prior to the 
initiation of Medicaid, nearly two thirds of all nursing home care expenditures 
were covered by private payments (Rabin & Stockton 1987). This percentage 
dropped to roughly 50% in 1967, the yéar after the Medicaid program began, 
and remained at that level throughout the 1970s. By 1984, Waldo & Lazenby 
(1984) find that only 3% of hospital expenses and 25% of physician costs 
were paid out-of-pocket; in sharp contrast, half of nursing home expenses 
were paid directly by the elderly. 

A recent Congressional report found that nearly half of the elderly whose 
nursing home expenses were covered by Medicaid became eligible by spend- 
ing down their income and assets after they entered the nursing home (US 
House of Representatives 1987). The same report estimated that in the 1980s, 
“after only 13 weeks in a nursing home, 7 in ten elderly persons living alone 
find their income spent down to the federal poverty level” (p. 6). Nearly half 
of elderly couples living alone are impoverished by the end of one year in a 
nursing home. Given that large portions of nursing home care are paid 
out-of-pocket, and 40% of those 65—69 will enter a nursing home at some 
time during their lifetime (US House of Representatives 1987), there will be a 
significant financial impact resulting from institutionalization among the 
growing ranks of the elderly. 

Comparable statistics are not available for earlier pre- and post-Medicaid 
years, but Davis & Rowland (1986) report figures that point to worsening 


'SGornick et al (1985) report that the growth in costs for hospital inpatient care between 1973 
& 1983 can be accounted for by five factors: overall inflation, 51%; increase in the size of the 
covered population, 7%; excess inflation in hospital input prices, 15%; higher admissions per 
capita, 4%; and more services used per hospital admission, 23%. 
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financial consequences of institutionalization. They find that between 1972 
and 1981, the number of elderly Medicaid beneficiaries increased slightly 
from 3.3 million in 1972.to 3.7 million in 1974, falling slowly to 3.5 million 
by 1981. However, the annual expenditure per beneficiary increased from 
$580 in 1972 to nearly $2800 in 1981. The trends in the number of beneficia- 
ries reflect the fact that there are proportionately fewer elderly poor who are 
eligible for Medicaid coverage, although the number of elderly is growing. 
But for those who enter nursing homes, the financial burden to both the 
Medicaid system and the elderly themselves has risen dramatically. Recall 
that the Medicaid expenditures for the elderly represent costs incurred to the 
system after the elderly have spent down their savings and assets. 

Unfortunately, no long-term individual-level cohort data exist to shed light 
on how it happens that elderly living in the community become in- 
stitutionalized, or how they cover the associated expenses. However, Rivlin 
& Wiener (1988) report simulated financial consequences of institutionaliza- 
tion based on current and projected utilization and expenditure patterns among 
the elderly. Given current levels of public support for nursing home care and 
prevailing medical and demographic trends, Rivlin & Wiener project that the 
risk of impoverishment will rise in the coming decades for the in- 
stitutionalized elderly. Out-of-pocket nursing home costs for those 65 and 
older will average (in constant 1987 dollars) nearly $30,000 for the entire stay 
in 2016-2020, with 17% of nursing home residents paying over $50,000. 
They estimate that of all nursing home expenses, the proportion paid out-of- 
pocket will nearly double between 1988 and 2020. Total Medicaid ex- 
penditures for nursing homes are expected to more than double during the 
same period. 


SUMMARY 


In purely economic terms, the typical elderly person today enjoys a standard 
of living that is much higher than at any time in the past, and perhaps higher 
than that of the average nonelderly person. While part of the improvement 
stems from a greater accumulation of wealth in the form of owned homes, 
financial assets, and private pensions, much of it is due to more generous 
Social Security benefits. When viewed as a pension, Social Security pay- 
ments to current retirees provide a much higher rate of return on past payroll 
contributions than private pensions would generally have paid. Soon, howev- 
er, this situation will be reversed: under current rules, the typical worker 
retiring after the year 2000 will receive benefits that reflect a lower-than- 
market rate of return, and the rate of return will continue to drop for new 
retirees throughout the first quarter of the twenty-first century. 

Whether or not one concludes that current retirees are unfairly advantaged 
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and future retirees unfairly disadvantaged by the structure of Social Security 
depends on the model used to assess system equity. Within the context of a 
pension model, high benefits to current retirees represent a huge income 
transfer from current workers to current retirees. Given the overwhelming 
support accorded Social Security expenditures by both elderly and nonelderly, 
the transfer may well reflect a societal consensus that the elderly constitute the 
group most deserving of higher transfers, perhaps in part because Social 
Security benefits substitute for the more problematic transfers from adult 
children to elderly parents that would otherwise have to be made. However, 
the pension model also views the higher payroll taxes on current workers used 
to finance the transfer as increasing society’s obligation for future benefits 
when current workers retire. Current projections indicate that these obliga- 
tions will not be met. l 

Although the status of the average elderly person appears to be quite 
comfortable, many subgroups of elderly enjoy little economic security. 
Poverty rates are quite high among female and minority elderly, and the 
surprising turnover witnessed among the elderly poor means that larger 
numbers are at risk of at least occasional poverty than single-year figures 
would suggest. Economic status typically falls substantially after retirement, 
following a dynamic that is not well understood. 

Fortunately, the elderly in the United States are enjoying longer lives 
without an apparent decline in health. For those elderly requiring medical 
care, access to hospital and physician care has been enhanced by the advent of 
Medicare and Medicaid. These programs have allowed many elderly to avoid 
large out-of-pocket expenses for acute care. However, as the population 
continues to age, the elderly may be required to pay more of their medical 
expenses directly, a change that currently is the subject of heated debate. Part 
of this change is driven by a decline in the view that children should be 
financially responsible for their elderly parents (Crystal 1982). The more 
immediate factor that places the elderly in increased economic jeopardy is the 
risk of institutionalization. Many elderly live in the community free of 
disabling health conditions, but those who must enter nursing homes run a 
substantial risk of losing any gains they may have made during their later 
years. 
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Abstract 


The topic of self-efficacy is part of a broad literature which has developed 
around the issues of human agency, mastery, and control. Its more delimited 
focus is on perceptions and assessments of self with regard to competence, 
effectiveness, and causal agency. Self-efficacy has become an important 
variable within social psychological research because of its association with 
various favorable consequences, especially in the areas of physical and mental 
health. It is also quite congruent with the Western emphasis on such values as 
mastery, self-reliance, and achievement. This review examines the nature of 
self-efficacy and related terms, reviews the research literature on the develop- 
ment of self-efficacy and how social structure and group processes affect this 
development, considers changes of self-efficacy over the life course, and 
reviews the consequences of self-efficacy for individual functioning and for 
social change. The focus of the review is on the social psychological literature 
within sociology, psychology, and to some extent political science. 


Introduction 


Interest in self-efficacy is both new and old. It has recently emerged as a 
major theme within social psychology, partly because of the renewed interest 
in self phenomena (Gecas 1982). But self-efficacy has also been around in 
one form or another for a very long time, embedded in such perennial 
philosophical concerns as the determinism and indeterminism of human 
action, voluntarism and intentionality, free will and causality, and other 
issues of human agency. These issues are particularly important to the social 
sciences where they have often served as axes for differentiating competing, 
and sometimes antagonistic, theoretical perspectives (e.g., “humanism” vs 
“positivism”). Within the larger topic of human agency, self-efficacy occu- 
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pies a much smaller and more delimited sphere. It refers to people’s assess- 
ments of their effectiveness, competence, and causal agency. But even within 
this more delimited sphere, there are many shades of meaning contained 
within self-efficacy and various closely related terms (as we shall see). While 
not completely ignoring the broader philosophical and theoretical perspectives 
on self-efficacy, the main focus of this review is on contemporary social 
psychological research and theory on self-efficacy and related concepts, 
mostly found in sociological, psychological, and, to some extent, political 
science literatures. | 


The Many Faces of Self-Efficacy 


Much of the contemporary work on self-efficacy is found in the psychological 
branch of social psychology. Two lines of development have emerged (dis- 
tinct from each other, but with occasional overlaps): (a) motivational theories, 
which conceptualize self-efficacy in motivational terms; and (b) cognitive 
theories, which conceptualize self-efficacy in terms of expectancies and 
perceptions of control. 

Motivation theories tend to emphasize the experience of causal agency and 
to see it as a fundamental human need and a basic element in one’s sense of 
self. White’s (1959) influential theory of “effectance motivation” was an early 
statement of this position. Effectance motivation was conceptualized as an 
intrinsic motivation to produce effects on the environment, that is, to make 
things happen. It was characterized by White as exploratory, creative, and 
playful activity. As such, it was also considered to be pleasurable, engaged in 
for its own sake. This theme of enjoyment—“the joy of being a cause” or “the 
pleasure of mastering challenges”—~is not uncommon to motivation theories 
of self-efficacy (see Reeve et al 1986, and Csikszentmihalyi 1975). A number 
of scholars have built on White’s formulations. Harter (1978), using the term 
“competence motivation,” distinguished between two types: one is a striving 
to master challenges; and the other is a striving to encounter challenges 
(White placed more emphasis on the latter than on the former). Harter also 
investigated developmental changes in the content of competence motivation. 
Adler’s (1927) concept of “mastery motivation” emphasized the striving for 
competence in dealing with one’s world, as does Smith’s (1968) discussion of 
the “competent self,” Foote & Cottrell’s (1955) concept of “interpersonal 
competence,” Deci’s (1975) discussion of intrinsic motivation in terms of the 
needs for “self-determination” and “competence,” and to some extent the 
literature on achievement motivation (Atkinson & Raynor 1975). 

Emphasis on “control” rather than “competence” is evident in some of the 
motivation theories of self-efficacy. As with competence, the stress in these 
theories is on the experience of control and the motivation to be in control, 
which distinguishes motivational theories from the orientation to control 
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found in cognitive theories of self-efficacy. In discussing his concept of 
“personal causation,” DeCharms (1979;31) makes a clear distinction between 
these two orientations toward “control”: “The concept of personal causation 
and the concept of locus of control appear to be based on two distinct models. 
Personal causation attempts to tap the experience of controlling and being 
controlled. Locus of control is more in the ‘perceived control’ tradition.” And 
again: “We do not see control; we feel it. . . . The experience of control is 
derived far more from touch and kinesthesis than from vision” (1979:31). 
Curiously, DeCharms’ (1968) earlier conceptualization of personal causation, 
and his distinction between “origins” and “pawns,” did emphasize perception 
(as in “the perceived locus of causality of behavior”), since it was grounded in 
attribution theory. But he has clearly departed from attribution theory and the 
perceptual emphasis in order to stress the experience of agency. The motivat- 
ing experience of control is also evident in McClelland’s (1975) “power 
motive,” the motivation to control others, which derives more from psy- 
chodynamic than from attribution theory. A somewhat different angle on 
control is found in Brehm’s (1966) “psychological reactance theory.” Psycho- 
logical reactance refers to the motivation to seek freedom from constraints. 
According to Brehm, individuals become motivationally aroused when others 
try to control their behavior, that is, when their freedom is threatened or 
constrained. 

Cognitive theories of self-efficacy, based largely on attribution and social 
learning theories, place more emphasis on beliefs and perceptions of causal- 
ity, agency, or control and less on the motivations to hold such beliefs. This is 
a matter of relative emphasis, however, since these beliefs do have motiva- 
tional implications. Inherent in the formulations of attribution theory is the 
assumption that causal attributions are made in order to make the social world 
predictable and controllable (Pittman & Heller 1987). Kelley (1971:22) states 
that people’s motivation to maintain control has a pervasive influence on their 
attributions of causality: “The purpose of causal analysis—the function it 
serves for the species and the individual—is effective control.” 

Particularly relevant here are the self-attributions individuals make with 
regard to personal control over events that affect them. In an influential 
article, Rotter (1966) distinguished between “internal” and “external” locus of 
control, as generalized expectancies that individuals develop in relation to 
their environment. This distinction spawned numerous studies on the con- 
sequences of being “internal” vs “external” in causal attributions (see Wort- 
man 1976, Lefcourt 1976 for reviews). Refinements of these concepts began 
to appear. An important distinction made by Gurin et al (1978) is between 
“personal control” (one’s perceived sense of control) and “control ideology” 
(one’s judgment of how much control people in general have over their 
situations). In most of the research on the consequences of causal attributions, 
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it is “personal control” rather than “control ideology” that is the more con- 
sequential for individual behavior (some of this research is reviewed later). 
Seligman’s (1975) concept of “learned helplessness” deals with the negative 
consequences of lack of personal control. Learned helplessness refers to a 
chronic sense of inefficacy resulting from learning that one’s actions have no 
effect on one’s environment. Seligman and his colleagues (Abramson et al 
1978) argue that depression associated with learned helplessness is likely to 
occur when the individual attributes his or her inefficacy to personal failure 
rather than to universal conditions. 

Bandura (1977, 1981, 1986), in developing his “self-efficacy theory,” adds 
another refinement. From the perspective of social learning theory, centered 
on self-evaluation processes, Bandura distinguishes between efficacy ex- 
pectations and outcome expectations. An efficacy expectation is a belief that 
one can successfully perform a particular action. It is a judgment of one’s 
personal efficacy. An outcome expectation is an estimate that a given action 
will lead to a certain outcome (Bandura 1977:193). The former is a belief 
about one’s competence, the latter is a belief about one’s environment. The 
importance of this distinction is that feelings of futility may result either from 
(a) low self-efficacy or (b) perceptions of a social system as unresponsive to 
one’s actions. “To alter efficacy-based futility requires development of com- 
petencies and expectations of personal effectiveness. By contrast, to change 
outcome-based futility necessitates changes in prevailing environmental con- 
tingencies that restore the instrumental value of the expectancies that people 
already possess.” (Bandura 1977:205). Thus, Bandura differentiates per- 
ceptions of self from perceptions of self in relation to the social environ- 
ment-—a distinction that is important to traditional sociological and political 
science concerns. 

Bandura’s distinction between “efficacy expectations” and: “outcome éx- 
pectations” has not gone -without criticism. Eastman & Marzillier (1984) 
criticize Bandura for the ambiguity of the distinction, arguing that the way in 
which these terms are defined makes it difficult to treat outcome expectations 
separately from efficacy expectations (also see Bandura 1984 for a response). 
They do have a point. But the problem, I think, is with the label “outcome © 
expectation” and not with the underlying idea. A comparable but less 
ambiguous concept is “system responsiveness,” found mostly in the sociolog- 
ical and political science literatures on political activism (discussed below). 

For that matter, similar distinctions have been around longer in the political 
sociology and political science literatures, and they are more likely to be 
utilized in research (Bandura’s research, as well as that of others who have 
relied on his self-efficacy theory, has focused on efficacy expectations and 
has neglected outcome expectations). Gamson (1968), a political sociologist, 
distinguished “competence” (i.e. perceptions of one’s ability to influence 
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political systems) from “trust” in political institutions (i.e. perceptions that 
political institutions and authorities do what they are supposed to do). Almond 
& Verba (1965) and Barber (1983) distinguish between self-perceived com- 
petence and “system’s responsiveness.” Working within the framework of 
organizational psychology, Porter & Lawler (1968) distinguish between “ex- 
pectancy” (i.e. efficacy expectation), the belief that effort will lead to per- 
formance, and “instrumentality” (i.e. outcome expectation), the belief that a 
certain level of performance will lead to rewards in the organization. Lane 
(1962:149), a political scientist, observed that political efficacy “has two 
components—the image of the self and the image of democratic gov- 
ermment—and contains the tacit assumption that an image of self as effective 
is intimately related to the image of democratic government as responsive to 
the people.” Similarly, Gurin & Brim (1984) in assessing changes in political 
beliefs distinguish between “personal efficacy” and “system responsiveness.” 
All of these scholars of sociological and political science persuasion, along 
with Bandura, argue for the importance.of distinguishing between the self and 
environmental components of efficacy. However, they tend to differ from 
Bandura, not only in the terms used but also in conceptualizing these ex- 
pectancies as generalized expectancies about self and social system, rather 
than as expectancies quite specific to a particular task or situation. The 
importance of distinguishing between these two components of sense of 
control is more apparent when we consider the relevance of self-efficacy for 
social change. | 

Sociology has contributed several other “faces” or perspectives on self- 
efficacy.! The literature on alienation, based on Marx’s work (1844), is the 
most directly relevant. The central theme in Marx’s view of human nature was 
that of self-creation through efficacious action in the context of work or 
“praxis.” The self is created and affirmed through the individual’s work 
activity. Under conditions of alienation, however, the sense of self-efficacy is 
frustrated or inhibited because of a disjuncture between action and self, 
wherein work activity is no longer a reflection and affirmation of self. At the 
heart of the problem of alienation for Marx was the issue of control, that is, 
' the extent to which the individual has control over his/her labor and its 


‘There is a long and rich tradition in sociology which deals with voluntaristic action. Much of 
Weber’s work on the cultural and value bases of action; Parsons’ (1937) attempt to develop a 
voluntaristic theory of action; much of symbolic interactionism of the Blumer (1969) variety 
which deals with action as a construction based on goals, meanings, purposes, and other “future 
states,” and a good deal of contemporary exchange theory which focuses on choice and 
decision-making are part of this broad sociological concern with action and agency. An indication 
of the importance of voluntarism and agency in sociological thinking was the strong reaction to 
Wrong’s (1961) accusation that contemporary sociology has an “oversocialized conception of 
man.” Wrong’s article seemed to hit a raw nerve. 
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products. If the locus of control is outside the individual (e.g. in the “boss,” 
the “company,” the “machine,” etc.), he or she exists in a condition of 
alienation. To a large extent, this was the basis of Marx’s critique of capital- 
ism. A key feature of alienation, therefore, is powerlessness (see Seeman 
1975, 1985 for discussions of the contemporary literature on this theme). 
There are some obvious similarities between the Marxian perspective on 
powerlessness and control and some of the psychological literature on control 
discussed above [cf Lacey’s (1979) discussion of “agenda control”]. Marx, of 
course (and most contemporary sociologists), placed much more emphasis on 
the social structural context for the emergence and maintenance of power, 
powerlessness, and control. 

Self-theory within sociology, which is largely synonymous with symbolic 
interaction theory, provides another perspective on self-efficacy. Symbolic 
interactionism, following Mead’s (1934) formulations, has strongly empha- 
sized an active and creative view of the self. Grounded in the philosophy of 
pragmatism, with its emphasis on action and its consequences, the individual 
is viewed primarily as an actor in the environment, shaping and creating 
his/her world as well as being created by it. The concept of “T” in the reflexive 
self of Mead is the source of such agentive qualities as spontaneity and 
creativity. The active, efficacious self is also quite evident in more contempo- 
rary symbolic interactionist writings. It is apparent, for example, in Goff- 
man’s (1959) work on impression management as interpersonal control, 
Weinstein’s (1969) work on altercasting and interpersonal competence, and 
much of the work on identity negotiations (Blumer 1969, Strauss 1978). 

While the general theme of action and agency is quite pronounced within 
this sociological tradition (see Gecas 1982, 1986 for more extensive treat- 
ments of this perspective and its relevance to self-theory), the topic of 
self-efficacy is rarely treated explicitly. But it does surface occasionally. In 
studying sources of authenticity and inauthenticity for individuals, Turner 
(1976:991) observes that behaviors thought to reveal the “true self” are also 
ones whose causes are perceived as residing in the person rather than the 
situation. This is particularly evident when moral issues are at stake and 
attributions of responsibility are made to the self (Backman 1985, Schwalbe 
1988). Goffman (1961:43) provides a critical analysis, with moral im- 
plications, of the effects of total institutions on the personal efficacy of 
inmates: 


Total institutions disrupt or defile precisely those actions that in civil society have the role 
of attesting to the actor and those in his presence that he has some command over his 
world—that he is a person with ‘adult’ self-determination, autonomy, and freedom of 
action. A failure to retain this kind of executive competency, or at least the symbols of it, 
can produce in the inmate the terror of feeling radically demoted in the age-grading system. 
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Sykes (1958) makes a similar point in his study of prisoners. 

Most sociological work on self-efficacy is grounded in either the Marxian 
tradition or the symbolic interactionist perspective, or both. Rosenberg’s 
(1979:31-—33) discussion of “self-confidence” as a dimension of self-concept 
is based largely on a symbolic interactionist perspective. Also drawing on 
symbolic interactionism, Franks & Marolla (1976) distinguish between “inner 
self-esteem,” which is based on efficacious action of the individual and “outer 
self-esteem,” based on social approval. Gecas & Schwalbe (1983) extend this 
discussion of efficacy-based self-esteem by linking it to social structure and 
utilizing both Marxian and interactionist perspectives. Giddens (1979:88) also 
utilizes both perspectives, plus his own brand of structuralism, in his ‘con- 
ceptualization of power as the “transformative capacity of social actors,” 
which refers to actors’ capacities to achieve intended outcomes.’ Research on 
self-efficacy from these sociological perspectives is reviewed mostly in the 
sections dealing with social structural and cultural influences, whereas psy- 
chological research is most evident in the sections on consequences of 
self-efficacy and on its early development. But first, a note on measure- 
ment. 

There are several different approaches to the measurement of self-efficacy. 
Generally, self-efficacy measures fall into one of the following categories: 
task-specific measures (such as Bandura’s approach which develops measures 
specific to the task or competency at hand and makes no claim to generality of 
efficacy beliefs beyond that task); domain-specific measures [e.g. Wallston et 
al (1976); Health Locus of Control Scale, which measures control orientations 
specific to the health domain; and Lane’s (1959), political efficacy measure]; 
and general measures [e.g. Pearlin’s (1983) mastery scale; Gecas’ (1971) 
self-efficacy scale; and Sherer et al’s (1982) self-efficacy scale, which mea- 
sure people’s general sense of personal efficacy or mastery]. These measures 
vary in format from multiple-item semantic differentials (Gecas 1971) to 
Likert-type scales of “agree/disagree” set to statements indicating efficacy or 
inefficacy (Pearlin 1983, Sherer et al 1982, Wallston et al 1976), to single- 
item indicators of the degree of certitude individuals have in their ability to 
perform a particular task (Bandura et al 1980). Each of these approaches to 
the measurement of self-efficacy is justifiable and has merit, depending on 
one’s research objectives. 

In general, however, the measurement of self-efficacy is still rather primi- 
tive, even by social psychological standards. Compared to other psycholog- 
ical constructs, such as self-esteem, instrument development has not been 
very extensive. This is due mainly to the relative recency of self-efficacy in 
social psychological research. Especially needed is more extensive develop- 
ment of a general measure of self-efficacy. 
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Consequences of Self-Efficacy for Individual Functioning 


There is a great deal of research on the consequences of self-efficacy for 
individual functioning and well-being (for reviews, see O’Leary 1985, Ban- 
dura 1986: Ch. 9, Wallston & Wallston 1978, Strickland 1978). In general, 
this research indicates that high self-efficacy has beneficial and therapeutic 
-~ consequences for individuals, and low self-efficacy (powerlessness) has nega- 
tive and maladaptive consequences. Research based on Bandura’s theory has 
found self-efficacy to be a significant factor in various health-related be- 
haviors: overcoming various phobias and anxieties (Bandura et al 1980, 
1982); alcohol and drug abuse (Seeman & Anderson 1983, Newcomb & 
Harlow 1986); eating disorders (Schneider & Agras 1985); overcoming smok- 
ing addiction (DiClemente 1985); and increasing pain tolerance (Neufeld & 
Thomas 1977). Efficacy and control also play a central role in Kobasa’s 
(1982) concept of “hardiness.” Recovery from illness or injury seems to be 
better and more rapid for those individuals who have strong beliefs in their 
physical efficacy (see Schwalbe & Gecas 1988, for a review). Taylor (1985) 
reported impressive findings on the therapeutic effect of perceived “cardiac 
efficacy” (i.e. the belief that one’s heart is working well) in postcoronary 
rehabilitation. The accumulating evidence suggests that Reece ery makes a 
difference in matters of health, illness, and recovery.” 

The beneficial effects of self-efficacy seem to occur through changes i in the 
individual’s behavior toward self and environment. Seeman & Seeman (1983) 
found individuals with high self-efficacy to be more likely to initiate pre- 
ventive care, to seek early treatment and be more optimistic about its 
effectiveness, to rate their health as better, and to be sick less often. Efficacy 
beliefs have also been found to affect heart rate and blood pressure (Bandura 
et al 1982), levels of exercise, and sticking to difficult medical regimes 
(O’Leary 1985). 

There are also suggestions in the health literature that self-efficacy (and 
other positive attitudes and states of mind) might work in more mysterious 
ways to affect health or recovery from illness, perhaps directly affecting the 
endocrine system or the immune system. Norman Cousins’ (1979) personal 
account of his battle with a serious, and usually fatal disease, described the 
turning point in his illness as occurring when he rejected the passive role of 
patient and insisted on being an active participant in his own therapy. In the 


*While the determination of causality is never certain in social psychological research, we are 
on fairly firm ground in assuming the causal influence of self-efficacy as reported in these studies 
since most of this research is based on experimental or longitudinal designs. Still, in “real life” 
the relationship between self-efficacy and various psychological and physical states is un- 
doubtedly reciprocal, e.g. feelings of inefficacy can lead to depression, and being depressed can 
contribute to one’s sense of inefficacy. 
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medical literature, there are numerous other such inexplicable accounts of 
“spontaneous remissions,” “miracle cures,” and “placebo effects,” in which 
self-efficacy, strength of belief, or state of mind seem to play a key role. But 
how is still a mystery. 

Based on this research, it is easy to overstate the therapeutic consequences 
of self-efficacy. But there are limits to its salutary effects. Research on cancer 
patients finds that beliefs about efficacy don’t make much difference in 
determining the course of the disease (Cassileth et al 1985; and see Meyero- 
witz et al 1986 for a review). Those who score high on internal control or 
beliefs about personal efficacy do not seem to fare any better than those with 
low self-efficacy or an external sense of control. Wortman & Dunkel-Schetter 
(1979) suggest a strong sense of internal control may not be helpful in 
situations in which personal control over outcomes is minimal, since patients 
may become discouraged when their beliefs in personal efficacy are chal- 
lenged by their inability to control the disease. Even worse, internals or those 
high in self-efficacy may be more likely to blame themselves for their 
deteriorating physical condition. Another situation in which self-efficacy is 
not associated with favorable health outcomes is in the context of the “type A” 
personality (Matthews 1982, Friedman & Booth-Kewley 1987). Among other 
things, Type A personalities tend to be high in self-efficacy and internal 
control. They are also at high risk for heart attacks, ulcers, and other 
stress-related ailments. These negative health outcomes, however, may also 
be due to some of the other characteristics associated with Type A personali- 
ties, such as hostility, impatience, and inability to relax. 

Self-efficacy has been. found to be important to mental health, particularly 
as it affects depression. In Seligman’s (1975) theory of “learned helpless- 
ness,” depression is likely to occur when one comes to believe that one’s 
actions have no effect on one’s environment, and when the individual attri- 
butes this lack of efficacy to personal failure rather than to universal con- 
ditions (Abramson et al 1978). In much of the research on self-efficacy and 
depression, self-efficacy serves a mediating or buffering role between some 
type of stress and depression. Pearlin et ‘al (1981) found job disruptions and 
economic strains to contribute to depression mainly through their negative 
effects on self-efficacy. Similarly, the depressing consequences of physical 
injury and disability have been found to vary depending on their effects on 
self-efficacy (Turner & Noh 1988, Gecas et al 1988, Schulz & Decker 1985). 
Other aspects of mental health found to be affected by self-efficacy, or-rather 
the lack of it, are paranoia and schizophrenia. Mirowsky & Ross (1983) argue 
that feelings of powerlessness combined with the threat of victimization and 
exploitation tend to produce paranoia. They found these conditions to prevail 
for those of low socioeconomic status, of Mexican heritage, and for women. 
Some forms of schizophrenia have been described as involving intense feel- 
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ings of inefficacy, particularly from the inability to connect words and acts, or 
symbols and actions (Becker 1964). It is easy to see how feelings of in- 
efficacy, powerlessness, incompetence, and victimization can have negative 
consequences for one’s mental health. It is not surprising therefore that people 
engage in distortions of reality and often operate under the illusion of personal 
control and efficacy (Langer 1975), thereby sometimes increasing their sense 
of well-being (Alloy & Abramson 1979, Lewinsohn & Mischel 1980). 


Development of Self-Efficacy 


Self-efficacy, as encompassing the individual’s sense of control and beliefs 
about causality, develops out of the early interactions between individual and 
environment. Specifically it is the environment’s responsiveness to the in- 
fant’s actions that is critical in the development of a sense of self and a 
conception of one’s world—a central theme in the developmental theories of 
Piaget (1954) and Mead (1934). Gurin & Brim (1984:283) describe this early 
drama as follows: 


Through interacting with the world the infant early on begins to grasp an understanding of 
causality. . . . In those moments when the infant tries to make interesting experiences 
continue, one of life’s critical dramas takes place. During this drama the infant’s earlier 
nearly exclusive dependence on external causation gives way to a sense of self as a causal 
agent. . . . Causing things to happen is the medium through which the young infant begins 
to exercise control and to understand cause and consequence more fully. 


It is the quality of the individual-environment interaction, primarily with 
regard to the opportunities it provides for engaging in efficacious action (e.g. 
a stimulating, challenging, responsive environment and the freedom to en- 
gage it), that continues to be the major condition for self-efficacy throughout a 
person’s life. Beyond infancy, however, other sources (that depend more on 
symbolic interaction) emerge as consequential. Besides performance accom- 
plishments, Bandura (1977) discusses three other sources of efficacy informa- 
tion: vicarious experience (i.e. seeing others perform challenging or threaten- 
ing activities successfully); verbal persuasion (i.e.information from others 
about one’s abilities); and emotional arousal (i.e. inferences individuals make 
about their capabilities from their emotional states, such as fear). The most 
effective source of efficacy information, for Bandura, is personal mastery 
experiences (i.e. performance accomplishments). 

Initially, the family context is the most important environment for develop- 
ing self-efficacy, and parents are the most significant aspects of this environ- 
ment for the child. Research on parent-child interaction has consistently found 
that parental responsiveness, support and encouragement, use of inductive 
control (which relies more on reason than on coercion), and high achievement 
demands are significantly related to the development of children’s self- 
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efficacy, and a number of other positive socialization outcomes—such as, 
self-esteem, achievement motivation, interpersonal competence, and pro- 
social behavior (Baumrind 1975, Baumrind & Black 1967, Gecas 1971, 
Gecas & Schwalbe 1986, Thomas et al 1974, Mortimer et al 1986: ch. 2, and 
see Rollins & Thomas 1979, for an extensive review). In particular, it seems - 
that the combination of parental support/encouragement with the use of 
inductive control is most conducive to the development of competent, effica- 
cious children, who also perceive themselves as competent and efficacious 
(Baumrind 1977, Gecas 1971).° There is also some evidence that modeling is 
involved; children may learn to be efficacious by modeling efficacious parents 
(Whitbeck 1987). 

However, it should be noted that these relationships are much stronger for 
the child’s perceptions of parental behavior and his/her self-efficacy than for 
actual parental behavior (or parental reports of their behavior) (Gecas & 
Schwalbe 1986). Furthermore, Gecas & Schwalbe (1986) and others (Demo 
et al 1987) found that there isn’t much overlap between parental reports of 
their behavior on dimensions of support and control and the children’s 
perceptions of this parental behavior. Consistent with symbolic interaction 
interpretations of socialization, these studies suggest that children’s self- 
efficacy, and other aspects of self-conception, are more responsive to chil- 
dren’s interpretations of family “realities” than to those realities as viewed 
from others’ perspectives. Bandura et al (1982) makes a similar point when he 
states that self-efficacy is enhanced not so much by performance successes as 
by how people perceive these successes. | 

Family structure may also affect the development of children’s self- 
efficacy. Rodin (1976) found that family size, in terms of density of living 
conditions, had a negative effect on children’s sense of personal control. She 
suggests that the increase in density of one’s living conditions also increases 
the probability that others will infringe on one’s “space” and that events may 
be more unpredictable. It might also be the case that as family size and density 
increase, parental control becomes more rigid and authoritarian (Elder & 
Bowerman 1963), thereby restricting the freedom and opportunities for effica- 
cious action for children. It may also be restrictive for parents—Duncan & 


3Baumrind (1977) also reported some findings, from her study of a small sample of 8 and 
9-year-old children, of an interesting sex difference in this socialization pattern. She found 
personal agency in girls to be associated with a childrearing pattern of firm control and somewhat 
stressful, abrasive interactions, particularly with their fathers. Also, parental warmth was nega- 
tively related to personal agency in girls. The pattern of these parental variables and personal 
agency was opposite to that found for boys. Her explanation for these surprising findings is that 
the cultural emphasis for women’s sex-role includes dependence and lack of assertiveness which 
nurturant socialization merely encourages. In order for girls to develop personal agency they need 
a more abrasive, demanding environment than do boys (for whom the cultural emphasis already 
encourages self-efficacy and agency). This relationship, however, has yet to be corroborated by 
other researchers. 
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Morgan (1980), in a large study of American families (N = 2000), found 
increase in family size to have negative effects on self-efficacy of fathers. 


The Effects of Social. Structure on Self-Efficacy 


The major thrust of sociological research on Self-efficacy deals with its 
connection to social structure. ‘The aspect of social structure most frequently 
considered is social stratification, or more specifically, social'class. Social 
class is relevant to the development of self-efficacy because it has a bearing 
on individuals’ opportunities to engage in efficacious action, particularly in 
the workplace, and because of its relationship to ideological belief sys- 
tems which differentially evaluate individuals’ actions (Gecas & Schwalbe 
1983). 

As mentioned earlier, much of the research on social structural antecedents 
of self-efficacy takes its lead from Marx’s ideas about social class and 
alienated labor. Early work in this area (1960s to early 70s) related indexes of 
industrialization or “modernization” to worker satisfaction and/or alienation. 
The operationalization of alienation was mostly guided by Seeman’s (1959) 
influential conceptualization, which treated the concept as comprised of five 
components: powerlessness (the aspect most relevant here); normlessness; 
meaninglessness; social isolation; and self-estrangement. Indexes of in- 
dustrialization—such as level of technology, extensiveness of division of 
labor, degree of bureaucratization, degree of mechanization or automation— 
were then related to worker alienation/satisfaction. Companies, industries, 
and sometimes societies which differed on the indexes of industrialization 
were then compared with regard to worker alienation/satisfaction. The find- 
ings were mixed: some found a curvilinear relationship between indicators of 
industrialization and alienation (such as Blauner 1964); some found a positive 
linear pattern (Susman 1972); and some found no relationship (Form 1973). 
This is hardly surprising, since the indicators and measures of industrializa- 
tion and alienation used, as well as the types of industries and work organiza- 
` tions studied, tended to vary considerably across these studies (for reviews of 
this research, see Shepard 1977; and Seeman 1975).* 

Recent work has delimited its approach to social structure, using specific 
indicators of socioeconomic status and occupational conditions, and has 
focused more explicitly on self-efficacy or related terms. Socioeconomic 
status, as measured by occupational prestige and education, generally is 
positively related to self-efficacy and mastery. and negatively related to 
powerlessness. These relationships are not very strong, but they are consistent 


*This line of research was generally condemned by Marxist scholars on at least two grounds: 
(a) it focused on the “wrong” independent variables, e.g. indexes of industrialization rather than 
the capitalist system of economic production; and (b) it distorted the concept of alienation—from 
Marx’s concepts of “alienated labor” and “reification” to Seeman’s psychologized version (see 
Israel 1971, and Schwalbe 1986). l 
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(Gurin et al 1978, Mirowsky. & Ross 1983; Wheaton 1980, Gecas & Seff 
1987). 

The more interesting work on these relationships has focused on occupa- 
tions, with analyses of how the nature and conditions of work affect workers’ 
self-efficacy. The most prominent work in this area is that of Melvin Kohn 
and his colleagues (1969, 1973, 1976, 1983). In a series of studies spanning 
several decades Kohn examined how occupational.conditions affect the values 
and psychological orientations of workers. He focused on conditions of work 
that enable or inhibit self-direction (i.e degree of supervision, degree of 
routinization, and the substantive complexity of the job). These occupational 
conditions, which are highly related to social class (or rather socioeconomic 
status), Kohn argues, constitute a major means by which social class impinges 
on people via the workplace. Kohn’s research, as well as that of others (see 
Kohn 1977, 1983, and Gecas 1979 for reviews) has found these occupational 
conditions to affect significantly the value of self-direction, degree of in- 
tellectual flexibility and several aspects of personality similar to self-efficacy. 
In general, Kohn finds that the greater the freedom experienced on the job and 
the more complex and challenging the work, the more likely is the worker to 
value individual freedom and self-direction, to be intellectually more flexible, 
and to have greater “self-efficacy.” 

Kohn’s main dependent variables, “intellectual flexibility” and “value of 
self-direction,” have an oblique relevance to the topic of self-efficacy in that 
they at least suggest competence, control, or agency. But several other 
variables examined are more directly relevant: “sense of powerlessness” (used 
by Kohn as an indicator of alienation), “self-confidence,” and “fatalism.”° 
Kohn (1976) found occupational self-direction (a composite measure of 
supervision, routinization, and complexity of work) to be related to 
powerlessness substantially more (r= —.21) than was ownership and hierar- 
chical status in the company (r=—.13)—-variables more likely: to be empha- 
sized by Marxist scholars in studying alienation. Significant relationships 
were also found.between occupational self-direction, particularly on the 
dimension of “substantive complexity of work,” and workers’ self-confidence 
and fatalism. Kohn & Schooler (1973; 1978) have gone to considerable pains 
to show that the relationships between work conditions and these psycholog- 
ical variables are reciprocal. However, this reciprocity is not equal: “In all 
cases, job affects man more than man affects job” (Kohn & Schooler 
1973:114).. 


*Kohn & Schooler (1983) also propose a variable called “self-directedness,” which at first 
glance seems relevant to self-efficacy, but in fact is a conglomerate of various “good” attributes. 
Kohn & Schooler define it as follows: “Self-directedness is reflected in not having authoritarian 
conservative beliefs, in having personally responsible standards of morality, in being trustful of 
others, in not being self-deprecatory, in not being conformist in one’s ideas, and in not being 
fatalistic” (1983:147). It is a variable created less by theory than by factor analysis. 
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Other scholars have also found that work autonomy, flexibility, and com- 
plexity are conducive to the development of workers’ self-efficacy. Mortimer 
& Lorence (1979) and Mortimer et al (1986) found that work autonomy 
significantly affects self-perceived competence in their panel study of male 
college graduates. Staples et al (1984) found degree of routinization and 
supervision at work to affect occupational self-efficacy significantly (nega- 
tively). Gecas & Seff (1987) found that these same occupational conditions 
are related to general self-efficacy for a sample of employed men. Spenner & 
Otto (1985) found “substantive complexity of work” had a small but signifi- 
cant effect on the sense of personal control for men and women. Schwalbe 
(1986), in an imaginative study of workers in five different industries, found 
that problem-solving demands at work were positively related to workers’ 
intellectual flexibility and negatively related to “reification” (i.e. external 
control ideology). It is evident from this line of research that occupational 
conditions that enable efficacious action are conducive to the development of 
self-efficacy. But it is also evident from these studies, in which the correla- 
tions and beta coefficients for these relationships are typically small-to- 
modest (in the .10 to .30 range), that other factors besides work conditions are 
important to self-efficacy. 

Some of these factors are other class-related variables, the most important 
of which is education. Kohn & Schooler (1983) found education to be as 
important as occupational conditions (the two are strongly interrelated) for his 
various personality variables. Mirowsky & Ross (1983) found, as have others 
(Gurin et al 1978), that education increases the sense of mastery and personal 
control. Income is also positively related to self-efficacy (Duncan & Liker 
1983, Downey & Moen 1987), especially when the income is earned rather 
than received through charity or welfare (Duncan & Morgan 1980). Not 
surprisingly, disruptions in employment status—being laid off, downgraded, 
losing one’s job because of injury—decrease workers’ sense of efficacy and 
increase powerlessness and associated psychological distress (Pearlin et al 
1981, Gecas et al 1988). Along with the various undesirable consequences of 
lower income, poorer education, poorer work conditions, and more uncertain 
employment status, the greater sense of inefficacy and absence of control 
among the lower classes is one of the “hidden injuries” of social class (Sennett 
& Cobb 1972). 

Some evidence suggests that race and ethnicity affect self-efficacy and 
perceived control, beyond their association with social class (Gurin et al 
1978, Lachman 1985). Gurin et al (1978) found whites, particularly white 
men, to have a significantly higher sense of personal control than blacks. 
However, they did not find any difference between the races on “control 
ideology,” that is, the belief in the degree to which people in general have 
control over their lives. This is somewhat surprising in that control ideology, 
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in the form of “system blame,” has been suggested as a psychological defense 
used by the powerless to account for their inferior status (Parkes 1971, Taylor 
& Walsh 1979)—more on this later. The lower self-efficacy of blacks, when 
compared to whites, seems to be a function of racial discrimination and its 
consequences for power, control, and access to resources enabling efficacious 
action. Interestingly, these conditions do not seem to affect black self-esteem 
in the same way—generally studies have found little difference in the self- 
esteem of blacks and whites (see Gecas 1982 and Rosenberg & Rosenberg, in 
press, for reviews). One explanation of these differences is that self-esteem is 
primarily dependent on one’s interpersonal context (i.e. local contexts of 
interaction and frames of reference), whereas self-efficacy is more responsive 
to social structural influences (Gecas & Schwalbe 1983, Gecas 1988, and see 
Hughes & Demo 1989, who provide some empirical evidence for this 
speculation in their study of the self-perceptions of black Americans). 

Cultural influences on self-efficacy are somewhat more evident than social 
structural influences in ethnic comparisons, although both sources un- 
doubtedly play a part. Cultures, subcultures, and ideologies that emphasize 
fatalism or fatalistic beliefs are associated with lower self-efficacy and beliefs 
in external control. Much of the evidence for cultural influences on self- 
efficacy comes from comparisons of Mexican-Americans and Anglos, in 
which Mexican heritage is found to affect beliefs in external control even after 
the effects of social class are controlled (Madsen 1973, Mirowsky & Ross 
1983, Ross et al 1983).° Cultural influences on self-efficacy are also relevant 
to examinations of sex roles and gender differences, discussed below. 


Sex Roles and Gender Differences in Self-Efficacy 


Research in child development as well as sociology indicates that males have 
a greater sense of self-efficacy, personal control, and mastery than do females 
in our society. Several extensive reviews of the psychological (de- 
velopmental) research on sex differences (Maccoby & Jacklin 1974, and 
especially Block 1976, 1983) indicate that boys have a more potent self- 
concept than do girls, as well as that they score higher than girls on aggres- 
sion, activity level, and impulsivity. Girls show more evidence than boys of 
learned helplessness in achievement situations (Dweck et al 1978). The 
opportunity to exert control over external events is found to be a more salient 
issue for boys than for girls (Gunnar-Von Gnechten 1978). Block 
(1983:1339-40) in her excellent review of the psychological literature on sex 
differences in self-efficacy concludes: 


The effect of cultural orientations on personality and behavior (with beliefs regarding 
personal control as one of the key elements) became a politically volatile issue in the “culture of 
poverty” controversy of the 1970s (see Gecas 1979 for a review of this literature and its relevance 
to social class and socialization concerns). | 
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The self-images of males, in contrast to those of females, include stronger feelings of being 
able to control (or to manipulate) the external world. . . . Males describe themselves as 
more powerful, ambitious, energetic, and as perceiving themselves as having more control 
over external events than females. . . . The self-descriptions of males, more than those of 
females, include concepts of agency . . ., efficacy . . ., and instrumentality—all reflections 
of a self-concept in which potency and mastery are important components. In contrast, 
females describe themselves as more generous, sensitive, nurturing, considerate, and 
concerned for others. . . . The self-concepts of females emphasize interpersonal relations 
and communion . . . and do not emphasize competition and mastery. 


Most of the research on gender differences in self-efficacy is on children. 
But these sex differences are also found in studies òf adults (Gurin et al 1978, 
Mirowsky & Ross 1983, Ross et al 1983, Lachman 1985). They tend to 
fluctuate, however, by age and stage of family life cycle. The largest sex 
differences between men and women on self-efficacy scores (reflected in 
male-instrumental and female-expressive scores) occur during the parenting 
stages in the family life cycle. By contrast, postparental men and women are 
more likely to identify cross-sex traits in their self-conceptions, making sex 
differences in self-efficacy less evident with increasing age (see Bengtson et 
al 1985 for a review of this research). 

The most common explanations for these differences involve cultural 
factors (e.g. sex-role stereotypes), structural factors (e.g. the structure of 
social environments of males and females), or both. At the level of cultural 
explanations, these sex differences reflect cultural conceptions of “masculin- 
ity” and “femininity” (as expressed, for example, in Bem’s 1974 scale), with 
their stress on agency, potency, and assertiveness in the former, and passiv- 
ity, dependence, and conformity in the latter. Sex-role socialization, there- 
fore, also implies socialization into different conceptions of self-efficacy for 
boys and girls. This is reflected in the toys and games boys and girls play, 
with their differential opportunities for the development of efficacy (Block 
1983). Structural explanations for these sex differences in self-efficacy point 
to power differences between men and women in society, women’s greater 
dependency and restricted opportunities, and the nature of housework and 
women’s jobs (Mirowsky & Ross 1986). Cultural conceptions of sex-roles, 
however, are clearly undergoing considerable redefinition and there is pres- 
sure for change. The extent to which these will be reflected in changed 
self-conceptions regarding efficacy and personal control depends to a large 
extent on how structures of interaction in family and workplace change. Such 
historical changes in gender differences in self-efficacy are yet to be docu- 
mented. 


Self-Efficacy Over the Life Course 


There is relatively little research on variations in self-efficacy over the life 
course, mostly because of the difficulty and expense of longitudinal research. 
What there is, however, suggests that it is a fairly stable self-assessment. 
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Mortimer et al (1986), in their ten-year panel study of a sample of college 
men, found a stability coefficient of .73 (in their path analysis) for “self- 
competence.” This is similar to the stability coefficient for “personal efficacy” 
of .78 reported by Gurin & Brim (1984) for their two-year panel study. 
Although these stability coefficients are high by the standards of social 
psychological research, they still suggest considerable room for variation in 
self-efficacy over time (approximately 50% variation). 

When self-efficacy has been considered over the life course (usually based 
on cross-sectional analyses of different.age groups), a curvilinear pattern has 
typically been found, with efficacy increasing through childhood and early 
adulthood, reaching a peak in middle age, and gradually declining after age 
sixty or so (Dowd 1975, Brim 1974, Gurin & Brim 1984, Woodward & 
Wallston 1987). Contrary to this pattern, Lachman (1986) found no evidence 
of decline in personal efficacy in later life. In a large sample of adults 
(N= 1212) with an age range of 36 to 67, Lachman found personal efficacy to 
. decrease in early middle age, remain stable through the remainder-of middle 
age, and to increase in the early 60s. Whether this finding was due to the 
lower expectations the elderly had for themselves (compared to the younger 
age groups), fewer responsibilities, sample peculiarities, or cohort effects was 
not clear. 

It is not only because the life course is associated with aging per se that we 
would expect to find some variation in self-efficacy, but also because of its 
association with life events. Normative life events (e.g. school graduation, 
marriage, parenthood, retirement, etc) are less likely to affect changes in 
self-efficacy than are atypical and off-time events because, as Gurin & Brim 
suggest, they heighten awareness of self: “Unlike typical, age-graded events, 
which tell more about the environment than about the self, off-time events tell 
people something about themselves—that it is their actions, personalities, 
something about them that caused the event to happen” (1984:315). For 
example, it is not retirement in itself, or marriage, or parenthood that is 
significant for changes in self-efficacy, but whether these events occur un- 
typically early or late (see McLaughlin & Micklin 1983 on the timing of the 
first birth and: its consequences for self-efficacy). The possible negative 
consequences of off-time events may be due to unfavorable social com- 
parisons with one’s age-mates who are not experiencing the event, and with 
those younger or older for whom the event is more common and therefore less 
stressful (Gurin & Brim 1984:320). In general, life events that increase role 
strain (Pearlin 1983), economic or social disruption (McLanahan 1983), or 
involve physical impairment (Gecas et al 1988) are likely to have a negative 
effect on self-efficacy. How individuals respond to adversity depends on a 
number of factors (including their initial sense of personal efficacy) which 
will determine any long-term effects. For example, Elder’s (1974) research on 
the consequences of the Great Depression showed that economic adversity 
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had some positive psychological consequences (regarding the development of 
competence and self-confidence) on children in the older cohort. Economic 
adversity had a positive effect (years later) on the older cohort because they, 
unlike the children in the younger cohort, could demonstrate competence and 
adaptive capacity in dealing with this crisis. 

The pattern of development of self-efficacy over time seems to reflect a 
high level of reciprocity between self and environment. Mortimer et al 
(1982), m a sophisticated longitudinal study of 368 men, demonstrate how 
early self-competence (in college) shaped these men’s life events in the areas 
of work and family, and how these life events, in turn, had an independent 
effect on self-competence ten years later. In this same vein, Gecas & Morti- 
mer (1987) observe that self-evaluations of efficacy indicate a pattern of 
spiralling success and failure. “Initial conceptions of self-efficacy, from 
whatever source, tend to become self-fulfilling prophecies, encouraging the 
taking of risks and giving confidence in the undertaking of new and challeng- 
ing tasks. Subsequent success in these endeavors fosters an increasing sense 
of personal efficacy over time” (1987:278). Clausen’s (1986) analysis of the 
Berkeley Longitudinal Studies found a strong relationship between “planful 
competence” in adolescence and various indicators of success in adulthood. 
Clausen’s research suggests that the negative pattern of reciprocity also 
operates: low self-competence in adolescence leads to poor decisions and poor 
coping abilities which adversely affect later self-competence and other per- 
sonality characteristics. 

Along with this pattern of reciprocity and a generally high level of stability, 
several other observations can be made about self-efficacy over the life 
course. Based on an extensive review of research on aging and self- 
conceptions, Bengtson et al (1985) conclude that cohort membership, gender, 
sociocultural trends, and life-stage experiences have more significant impact 
on self-conceptions (including self-efficacy) than does maturation. Further- 
more, since many of these studies are cross-sectional in design, Bengtson et al 
(1985:573) observe, age contrasts may in fact reflect cohort differences rather 
than maturational changes. They also conclude that subjective perceptions of 
self show more change with age than do objective assessments. 


The Consequences of Self-Efficacy for Social Stability and 
Change 


The effect of self on society is the most difficult and neglected connection 
within social psychology, though it is clearly important (Coleman 1986, 
Giddens 1979, Turner 1988). Self-efficacy may be a strategic concept for 
developing this connection. There are at least two lines of research and theory 
in which self-efficacy has an important role regarding social stability and 
change: legitimation of stratification or status inequalities; and political activ- 
ism. 
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Della Fave (1980, 1986a,b) offers a provocative theory of legitimation 
based on what he calls “self-evaluation theory.” Drawing on equity theory 
(Blau 1964), status attribution theory (Berger et al 1972) and the self-theories 
of Mead (1934) and Bem (1972), Della Fave proposes that people accept their 
locations within structures of inequality because they believe that their status 
is deserved. He argues that people develop favorable or unfavorable self- 
evaluations in proportion to their relative positions of advantage or dis- 
advantage in the stratification system. In other words, individuals infer from 
their positions in the system the worth of their contributions to the system, 
upon which they base their self-evaluations and in turn their notions of the 
fairness of the system of stratification. This process, Della Fave maintains, is 
a major mechanism in the legitimation of social inequalities and therefore in 
the stability of social systems. 

It is an appealing theory which has stimulated research, but the theory has 
also led to some confusion, particularly around the central concept of “self- 
evaluation.” Della Fave argues that “self-evaluation” in his usage does not 
refer to self-efficacy or self-esteem, which he considers much broader con- 
cepts. But his description of “self-evaluation” sounds very much like self- 
efficacy: “Self-evaluation is not to be confused with global self-esteem or 
self-efficacy. Self-evaluation is specific-to a person’s definition of his or her 
ability to control the larger social environment, the kind of control made 
possible by the accumulation of wealth and power in the major institutions of 
society, namely the economy and the state” (1986b:441). Furthermore, in 
commenting on research which has found only modest relationships between 
social class and self-esteem, Della Fave states: “Thus, their findings of a 
positive correlation lead me to believe that if a measure more specific to 
self-evaluation in terms of mastery of, and confidence in, abilities related to 
success in instrumental activity in social institutions were used the correlation 
would have been all the stronger” (1980:962). Clearly Della Fave has in mind 
self-evaluations in terms of power, resources, and perceived control. 

Tests of Della Fave’s theory of legitimation processes have produced mixed 
and not very encouraging results. Explicitly interpreting “self-evaluation” as 
self-efficacy, Stolte (1983) tested Della Fave’s propositions in an ex- 
perimental study of small group interaction. He found, as predicted, that 
positional power has a significant effect on self-efficacy. However, his data 
did not support the prediction that perceptions of fairness (of resource distri- 
butions) would not vary with the actor’s relative positional power, that is, the 
prediction that those in “disadvantaged” positions would think the system of 
rewards to be as fair as would those in “advantaged positions.” Rather, he 
found a significantly positive relationship between position in the “stratifica- 
tion system” and perceptions of fairness. Similarly, Shepelak (1987), using 
survey data, failed to support the proposition that disadvantaged persons 
believe that they deserve less. Both Shepelak and Stolte suggest refinements 
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and modifications of the theory, including a more explicit focus on self- 
efficacy and the specification of the conditions under which it is likely to lead 
to political action. Della-Fave (1986a) himself suggests that there are two 
processes at work, one based on equity (favoring legitimation and social 
stability), and a second based on equality (favoring delegitimation and social 
change). It might be the case that the subjects in the Stolte and Shepelak 
research were operating more on the principle of equality than of equity. 

Indeed, research on political activism suggests that it is the combination of 
high self-efficacy and perceptions of system unresponsiveness and high out- 
come expectations that generates not only resentment but also efforts at 
political change (Bandura 1986; Ennis & Schrener 1987, Marsh 1977, Gurin 
& Brim 1984). Paige (1971) found that those who participated in the Newark 
riot of 1967 tended to score high on political efficacy but low on political trust 
(i.e. percetved system responsiveness). Trust in government declined sub- 
stantially during the 1960s and 1970s (Gurin & Brim 1984), Lipset & 
Schneider 1987), providing a favorable context for activism and political 
confrontation. Gurin & Brim (1984:288) observed among challengers of the 
system that judgments of self-efficacy and system responsiveness were in- 
versely related: “In these groups, especially among minorities, the view of the 
self as competent and the system as unresponsive related to political action of 
both a traditional and nontraditional sort.” 

Concerted political action may also depend on perceptions of the group’s 
efficacy, or what Bandura calls “collective efficacy” (1986:449-52). Per- 
ceived collective efficacy refers to members’ judgments about their group’s 
capabilities to engage in successful political action. It is an important element - 
in sustaining members’ commitment to their cause. Its opposite is “collective 
powerlessness,” which is conducive to political inaction and apathy. 

Collective efficacy and collective powerlessness can also affect personal 
efficacy. For that matter, participation in collective behavior, such as political 
activism, may itself increase feelings of personal efficacy, especially if the 
actions are viewed as successful. The causal direction between self-efficacy 
and political activism in these studies is by no means clear, since these studies 
are all based on cross-sectional research. Here as elsewhere, the relationship 
between self-efficacy, behavior, and social environment is highly reciprocal. 


Conclusion 


This necessarily brief review of the social psychology of self-efficacy sug- 
gests that this is an increasingly active area of theory and research within 
several disciplines. Self-efficacy seems to have a somewhat more secure place 
within psychological social psychology, but it is also increasingly prominent 
within the sociological branch, often expressed in concepts other than self- 
efficacy but which nevertheless deal with the themes of agency, personal 
control, and perceived competence. The focus on self-efficacy in the social 
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sciences is pervasive, if sometimes hidden—a situation similar to Seeman’s 
(1983) observations regarding the concept of alienation (and, for that matter, 
overlapping his argument, since “powerlessness” is one of the dimensions of 
alienation). 

Reviewing the research on self-efficacy one gets the strong impression that 
high self-efficacy.is good to have. It leads to favorable or beneficial con- 
sequences for the individual and even for society, such as better physical and 
psychological health, creativity, cognitive flexibility, better problem-solving 
and coping skills, higher self-esteem, greater involvement in political pro- 
cesses, etc (see Deci & Ryan 1987 and O’Leary 1985 for particularly favor- 
able reviews which exalt the virtues of self-efficacy)—although the direction 
of causality is not always clear and is probably reciprocal in most situations. 
Even the illusion of efficacy and control (and often it is only that) seems to be 
beneficial. 

However, there may be an element of cultural relativism in all this. 
Certainly the theme of self-efficacy is congruent with the Western (especially 
American) emphasis on self-reliance, individualism, freedom, mastery, and 
personal achievement—values which are even reflected in many of the pop- 
ular “self-help” books (Biggart 1983). Self-efficacy may not be as central a 
concern, nor as important to physical and mental health, in other cultures, 
especially those with a more communal and less individualistic ethos (see 
Weisz et al 1984 for a comparison of America and Japan on the importance of 
sense of control). Clearly, more cross-cultural research on the antecedents, 
consequences, and types of self-efficacy would be useful. Also needed is 
research which more clearly determines the parameters of self-efficacy’s 
beneficial consequences, that is, under what conditions might high self- 
efficacy and personal control be dysfunctional for the individual? Some of the 
cancer research reviewed above is suggestive, but much more needs to be 
done along these lines. It is also unclear how and why self-efficacy changes or 
remains stable over the life course, how it is related to other aspects of self 
and personality, how context-specific assessments of efficacy are related to 
one’s general sense of self-efficacy, and how sociohistorical circumstances 
affect the development and consequences of self-efficacy. Much has already 
been done on the social psychology of self-efficacy and, clearly, much more 
should be done. 
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Abstract 


Recent work in the sociology of emotions has gone beyond the development 
of concepts and broad perspectives to elaboration of theory and some empiri- 
cal research. More work has been done at the micro-level than the macro-level 
of analysis. At both analytical levels, emotion most commonly is treated as a 
dependent variable, although increasingly, its role as an intervening and 
independent variable in social processes is being recognized, especially with 
regard to problems ‘in substantive fields as diverse as gender roles; stress, 
small groups, social movements, and stratification. Considerable gaps exist in 
sociological knowledge about emotions; in particular, little is known about 
distribution of different emotional experiences in the population, the content 
of emotion culture, emotional socialization processes, emotional] interactions, 
and relationships between social structure and emotion norms. More empiri- 
cal research is necessary, to build on the theoretical groundwork that has been 
laid. Problems in measuring emotional experience and aspects of emotion 
culture have not been addressed and are likely to become critical issues as 
empirical work accumulates in the future. 


INTRODUCTION 


Emotion is a relatively new substantive topic within sociology. Growing 
interest in emotion in evident in the establishment of a Sociology of Emotions 
Section within the American Sociological: Association in 1986, and the recent 
proliferation of paper presentations and publications (see chapter references). 
Increasing attention to the topic is likely due to the recognition that humans 
are not motivated solely by rational-economic concerns. Emotional attach- 
ments to others and affective commitments (e.g. desires, attitudes, values, 
moral beliefs) influence a significant portion of human behavior (Etzioni 
1988, Hochschild 1975). 
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Two previous reviews of this developing field have been published (Gor- 
don 1981, Scheff 1983). These emphasize broad conceptual problems and 
perspectives, reflecting the initial stages of work in a new topic area. In this 
chapter, I focus on elaborated theories of emotion in general (as opposed to 
theories about specific emotions), discuss some empirical findings, and high- 
light researchable questions that have not been addressed. Due to space 
limitations, I am unable to review articles that trace emotions issues in earlier 
sociological thought, that use affect to integrate disparate theoretical 
approaches, that are suggestive for, but only indirectly related to, emotions 
per se, and that are emergent but as yet undeveloped emotion theories. I also 
restrict this review to sociological work (drawing primarily from US sociolo- 
gy), with only limited references to relevant work in psychology, anthropolo- 
gy, and social history. 

I organize this review by two major approaches to emotion, “micro-level” 
(social psychological) and “macro-level” (structural-cultural). Within these 
broad levels, 1 examine emotion as it is treated theoretically: as a dependent 
variable, as an independent variable, and as an intervening or explanatory 
variable. I give more attention to emotions as micro-level phenomena because 
most theorizing and research efforts have been at this level. 

It is necessary to define emotion as the term is used in this chapter. There 
are almost as many definitions of emotion as there are authors, but most 
definitions refer to its components (e.g. Gordon 1981, Kemper 1978, Thoits 
1984, Frijda 1986). Emotions involve: (a) appraisals of a situational stimulus 
or context, (b) changes in physiological or bodily sensations, (c) the free or 
inhibited display of expressive gestures, and (d) a cultural label applied to 
specific constellations of one or more of the first three components. All four 
components need not be present simultaneously for an emotion to be ex- 
perienced or to be recognized by others; for example, one can be afraid but 
not know why, and young children clearly become emotional (in viewers’ 
eyes) without having acquired the words with which to communicate their 
experience. 

Emotions can be distinguished from feelings, affects, moods, and senti- 
ments. The first two are less specific terms, the latter two, more specific. The 
general term feelings includes the experience of physical drive states (e.g. 
hunger, pain, fatigue) as well as emotional states. Affects refer to positive and 
negative evaluations (liking/disliking) of an object, behavior, or idea; affects 
also have intensity and activity dimensions (Heise 1979). Thus, emotions can 
be viewed as culturally delineated types of feelings or affects. Compared to 
emotions, moods are more chronic, usually less intense, and less tightly tied 
to an eliciting situation. (Moods rarely are examined by sociologists.) Senti- 
ments are “socially constructed pattern[s] of sensations, expressive gestures, 
and cultural meanings organized around a relationship to a social object, 
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usually another person. . . . or group such as a family” (Gordon 1981 :566, 
567); examples of sentiments include romantic love, parental love, loyalty, 
friendship, and patriotism, as well as more transient, acute emotional re- 
sponses to social losses (sorrow, envy) and gains (pride, gratitude). As 
defined by Gordon, the term “sentiment” emphasizes relatively enduring 
social relationships as affect elicitors. The social emphasis in Gordon’s 
definition helps focus attention on what is important to sociologists about 
emotions, namely their social antecedents and their social acquisition and/or 
shaping. 

Along with most sociologists, I assume that emotions are not simply innate, 
biophysiological phenomena. Evidence indicates (see Ekman 1982) that cer- 
tain “basic” or “primary” facial expressions are culturally universal and thus 
probably innate. The basic emotions—interest, fear/surprise, anger, disgust, 
sadness, happiness/pleasure, and (perhaps) contempt (Ekman 1982, Ekman & 
Friesen 1986)—probably evolved as species-specific physiological reactions 
and expressive signals due to their utility for individual and group survival. 
But considerable variability exists historically and cross-culturally in the 
situational causes, experience, meaning, display, and regulation of emotions, 
including the basic ones (e.g. Stearns & Stearns 1986, Lutz & White 1986). 
Historical and cultural variability suggests that, to an important degree, 
subjective experiences and emotional beliefs are both socially acquired and 
socially structured. These processes, which intrigue and occupy sociologists, 
are explored in the sections below. 


MICRO-LEVEL APPROACHES TO EMOTION 
The Shaping of Emotion 


Most commonly, social psychologists treat emotion as a dépendent variable— 
the product. of social influences. This is partially because considerable initial 
debate focused on the degree to which emotions are sociocultural products 
(Kemper 1980, 1981; Shott 1980, Hunsaker 1983). 

Social construction theorists (Averill 1980, Gordon 1989b, McCarthy 
1989, Harre 1986) and symbolic interactionists (Hochschild 1979, 1983; 
Shott 1979) view emotions as primarily dependent on definitions of the 
situation, emotion vocabularies, and emotional beliefs, which vary across 
time and location. In contrast, -“positivists’ (Kemper 1978, Scheff 1979, 
Mazur 1985) view emotions as invariant, automatic, patterned responses to 
particular classes of social stimuli. Similar debates have occurred in an- 
thropology (see Levy 1973, Lutz 1982, Lutz & White 1986) and psychology 
(e.g. Zajonc 1980, 1984; Lazarus 1982, 1984); each debate springs from 
disagreements regarding the specificity of the links among emotional com- 
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ponents—situational stimuli, physiological changes, and expressive be- 
haviors. 

A “strong” version of social constructionism denies the existence of basic 
or primary emotions (Armon-Jones 1986), that is, there are no universally 
patterned links among physiological, expressive, and situational components 
of emotion. All emotions are socioculturally constructed concepts. A “weak- 
er” version of social constructionism accepts the existence of basic emotions 
but deemphasizes their causal importance. Social constructionists generally 
focus on demonstrating the degree to which both basic and nonbasic emotions 
are socioculturally constituted or influenced (Armon-Jones 1986). 

In contrast to constructionists, symbolic interactionists are more willing to 
recognize the influences of basic emotions in human action. They still assign 
considerable weight to sociocultural factors, however. Drawing from Schach- 
ter & Singer’s (1962) research, symbolic interactionists generally assume that 
the bodily changes accompanying emotions (at least the nonbasic emotions) 
are not specific; that is, environmental events elicit generalized arousal 
which, in turn, is interpreted as a particular emotion on the basis of salient 
situational factors. Emotions, then, are the joint product of generalized 
arousal and specific sociocultural factors (i.e. definitions of the situation and 
cultural labels). The same arousal may be experienced as joy or anger, 
depending on available situational cues (Schachter & Singer 1962).' Because 
situational definitions and emotion labels vary across time and cultures, so 
must emotional experiences, according to both constructionists and symbolic 
interactionists. Put another way, the key determinants of emotional experi- 
ence are not physiological but sociocultural. Thus, by extension, as many 
different emotions exist as are distinguished within a culture and between 
cultures. 

In contrast, positivists accept the “specificity” argument proposed original- 
ly by James (1890/1950) and revived more recently by Ekman and his 
associates (1983). James argued that one-to-one correspondences exist among 
the components constituting emotion. Certain classes of environmental stimu- 
li elicit specific autonomic responses, specific expressive behaviors, and thus 


'The Schachter & Singer (1962) study has been criticized on a number of grounds (e.g. 
Kemper 1978), including the failure of similar experiments to replicate its results (see review in 
Thoits 1984). However, the least reasonable proposition in the two-factor approach has been the 
source of most criticism and the cause of nonreplication. The proposition that no emotion will be 
experienced by initially unaroused subjects who are confronted with situational provocations has 
been disconfirmed repeatedly. Schachter & Singer ignored the obvious possibility that initially 
unaroused subjects would become aroused by situational stimuli that were presented. The 
majority of experiments in the two-factor tradition do support the more interesting proposition 
that manipulating situational cues will influence subjects’ interpretations of their physiological 
arousal (Cotton 1981, Reisenzein 1983, Manstead & Wagner 1981). 
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specific emotions.? Culturally universal emotions exist because at least some 
reactions are “wired in.” By extension, there are only a few basic or primary 
emotions; all others distinguished by cultures consist of varying intensities, 
blends, or rapidly alternating sequences of the basic few. 

Recently, Kemper (1987) has attempted an integration of positivist and 
constructionist viewpoints. He suggests that physiologically grounded pri- 
mary emotions become elaborated “through the attachment of social defini- 
tions, labels, and meanings to differentiated conditions of interaction and 
social organization” (1987 :276). “Secondary emotions” (shame, guilt, love, 
anxiety, resentment, and so on) are acquired through the pairing of primary 
emotional experiences (in the learner) with the secondary labels attáched (by 
socializing agents) to the specific circumstances eliciting the primary feeling 
or feelings. 

Although one still might disagree with the positivist premises and/or the 
particular derivations of secondary from primary emotions made by Kemper 
(e.g. guilt from fear, shame from anger, and so on), this more balanced (and 
potentially testable) integration of perspectives may lead to more fruitful 
dialogues among camps regarding how emotions are socioculturally con- 
structed and/or acquired through socialization. Other pressing sociological 
questions, however, need attention: How are various emotional experiences 
distributed across structural positions? What sociocultural factors and pro- 
cesses (e.g. emotional beliefs, differential socialization) influence in- 
dividuals’ understandings of their own and others’ emotional experiences? 
What are the consequences of these understandings for social stability and 
social change? Sociologists have barely begun to scratch the surface of these 
issues, as is seen in each section below. - 


DISTRIBUTIONS OF EMOTIONAL EXPERIENCES Gordon (personal com- 
munication 1986) has pointed out that we lack basic information about the 
“epidemiology” of emotions, both in our own culture and across cultures.’ 
We hold many beliefs about the frequencies and distributions of emotional 
experiences, but these have not been tested systematically. For example, due 


Thus, positivists (e.g. Kemper 1978) also might accept the assertion that situational events 
are the key determinants of emotion. However, positivists assume that certain classes of 
situational events (e.g. decrements in power or status) directly and universally stimulate the same 
emotion because event-emotion links are physiologically grounded, while constructionists and 
symbolic interactionists assume -that the same classes of situational events may be interpreted as 
quite different emotions cross-culturally, regardless of their physiological grounding. (I am 
grateful to Steven Gordon for this observation.) 

3Change in emotions across time also is relevant but is discussed below under “macro 
approaches.” 
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to differential gender socialization, women are thought to be more empathic, 
more loving, and less able to feel (or express) anger, while men are believed 
to feel (or express) fear and sadness less frequently compared to women 
(Hochschild 1981, Peplau & Gordon 1985, Cancian 1987). Negative feelings 
more often may be directed down the status hierarchy (toward less powerful 
individuals), while positive feelings are directed up (toward those more 
powerful) (Hochschild 1979). Thus, higher status individuals (men, whites, 
the married, those of higher socioeconomic status) should be happier and 
more satisfied, since positive emotional experiences should outnumber nega- 
tive ones in experiential frequency. Although some suggestive evidence for 
these hypotheses exists (e.g. Eisenberg & Lennon 1983, Brody. 1985, Peplau 
& Gordon 1985, Ross & Mirowsky 1984, Lombardo et al 1983, Frost & 
Averill 1978, Tavris 1982, Bradburn & Caplovitz 1965, Campbell et al 1976, 
Myers et al 1984), we have very little reliable knowledge about the distribu- 
tions of affective experiences. The frequencies and distributions of various 
feelings across social positions (particularly “everyday” emotions rather than 
pathological states such as depression or phobias) are basic and necessary 
pieces of data for building a sociological theory of emotions. 


BELIEFS ABOUT EMOTIONS We also lack a “social epistemology” of emo- 
tions (Gordon 1989b). Subjective experiences are influenced not only by a 
society’s emotion vocabulary, but by cultural beliefs about emotions. These 
include rules regarding what one should and should not feel or express; 
ideologies about emotions such as romantic love; shared understandings of the 
typical onsets, sequences, and outcomes of emotional experiences and in- 
teractions; socially defined exemption periods from expectations of emotional 
conformity; and beliefs about which emotions can and cannot be successfully 
controlled. These “ethnopsychologies” or “emotion cultures” (Gordon 1989a, 
1989b) barely have begun to be studied. To date, sociologists mainly have 
attended to cultural feeling and display rules as they influence individual 
experience. l 

Hochschild (1979) and Ekman (1982) have elaborated the concepts of 
“feeling rules” and “display rules,” respectively. Feeling rules, or emotion 
norms, refer to beliefs about the appropriate range, intensity, duration, and 
targets of private feelings in given situations. Display rules, or expression 
norms, similarly regulate the range, intensity, etc, of emotional behaviors. 
Using respondents?’ self-reports, participant observation, and experimental 
manipulations, several researchers have demonstrated such normative in- 
fluences on individuals’ feelings and behaviors (e.g. Hochschild 1979, 1983, 
1989a; Saarni 1979; Friesen 1972; Clark 1987). But despite these compelling 
demonstrations, several problems remain. Emotion and expression norms 
have not been identified systematically, the degree of consensus about each 
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norm is unknown, and whether particular norms are viewed as social conven- 
tions or moral obligations has not been assessed. 

A recent exception is notable. Clark (1987) has examined sympathy norms 
and the rules of sympathy exchange, using participant observation, survey 
responses to vignettes, and content analysis of greeting cards, advice col- 
umns, etiquette books, song lyrics, and other media. People generally believe 
that they have a right to sympathy during times of trouble, that sympathy 
should be given to the deserving (i.e. those who are not exaggerating their 
needs and are not at fault for their plight, modified further by the victim’s 
status characteristics relative to the giver), and that recipients of sympathy 
should be grateful or deferent and should reciprocate at some later time. Norm 
violations engender resentment, avoidance, and social condemnation, 
suggesting that sympathy norms carry moral weight. Similar studies of other 
feeling and display norms are needed. Investigators frequently take for 
granted that broad normative consensus exists and that individuals have 
internalized norms as moral obligations. These assumptions may be un- 
warranted. 7 

Emotion cultures consist of more than feeling and expression rules, howev- 
er (Gordon 1989a, 1989b). Societies develop nuanced vocabularies for certain 
feelings, complex ideologies about some emotions, and elaborated 
ethnopsychologies (folk beliefs about affective development and intrapsychic 
processes). Anthropologists have taken the lead in examining these aspects of 
emotion culture, primarily in non-Western and less developed societies (Gor- 
don 1989b, Lutz & White 1986). Sociologists will need to study their own 
subjects’ emotion cultures. Mapping the “cultural universe” of beliefs about 
feelings (e.g. “Women are too emotional to be President”) is an important 
task not only because these beliefs influence individuals’ experiences and 
behaviors, but because they reveal certain macro-level tensions as well (de- 
scribed below). 

Emotion cultures also must be learned. How emotional knowledge is 
acquired and subjective experiences are shaped is the next issue. Here, again, 
major gaps in sociological research are revealed. 


EMOTIONAL SOCIALIZATION Developmental psychologists have begun to 
study the role of various learning processes—reinforcement, role modeling, 
imitation, identification, instruction—in shaping children’s emotional re- 
sponses (Campos et al 1983, Lewis & Saarni 1985). They also are beginning 
to document the ages at which children have acquired certain affective 
knowledge and skills (Lewis & Michalson 1983, Harris & Olthof 1982, 
McCoy & Masters 1985, Saarni 1979). However, psychologists mainly ex- 
amine sex and age-related variations in socialization practices and children’s 
knowledge; ethnic, social class, and structural/contextual variations are over- 
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looked. Sociologists have much to contribute, especially in bringing attention 
to structural factors that should influence socialization processes and out- 
comes (e.g. Gordon 1989b, Kohn 1969, Hochschild 1983, Bernstein 1974). 
For example, societies deliberately structure children’s environments so that 
the experiences of some emotions are more likely than others. The more a 
society’s institutions are aged-graded (one consequence of modernization), 
the less children’s knowledge of emotion should correspond to adults’ (Gor- 
don 1989b). 

The few sociological investigations of emotional socialization that have 
been performed to date have focused on special subgroups, such as disturbed 
adults and children, medical students, athletes, and service workers (Wiley 
1988, Pollak & Thoits 1989, Richman 1988, Smith & Kleinman 1989, Ortiz 
1988, Adler & Adler 1988, Hochschild 1983). These are valuable as detailed 
descriptions of socialization practices and/or socialization outcomes. Howev- 
er, because these studies usually do not include comparison groups, they do 
not illuminate structural influences on the content and style of what is taught 
or learned. For example, Hochschild (1983) has argued that the requirements 
and rewards of service-sector jobs produce workers who specialize in “emo- 
tional labor,” i.e. controlling their own feelings while attempting to enhance 
the positive feelings of others. She further argues that the differential require- 
ments and rewards of blue-collar versus service-sector jobs should affect 
parents’ emotional socialization practices. This in turn implies that middle 
class offspring should be more adept than lower class children at “feeling 
work”—i.e., transforming what they actually feel. But the frequency and 
types of emotion management that are actually employed by blue-collar and 
service-sector workers have yet to be directly compared; consequently, the 
further implications’ of Hochschild’s argument may not hold. 

The preceding discussion implied a “social structure and personality” 
approach to emotion. Individuals’ feelings and their emotion work efforts are 
treated as products of exogenous macro factors, such as organizations, occu- 
pational structures, and broad cultural ideologies. Social structure and per- 
sonality approaches might be called one of the “three faces” (House 1977) of 
the social psychology of emotion. The remaining “faces” (formal social 
psychological theories and symbolic interactionism) emerge below. 


SOCIAL ANTECEDENTS OF EMOTION A different tradition of social psycho- 
logical work examines social circumstances that elicit specific emotions or 
sentiments. This problem has been approached both inductively and de- 
ductively. 

Perhaps the most ambitious qualitative study has been conducted by Scher- 
er and his colleagues (1986). Through content analysis of open-ended ques- 
tionnaires distributed to 779 college students in eight European countries, 


SOCIOLOGY OF EMOTIONS 325 


Scherer et al found strong cross-cultural consistencies in beliefs about 
emotion-eliciting situations: Joy/happiness followed from developing rela- 
tionships with friends, reunions with friends, and success experiences; sad- 
ness/grief followed from problems with friends and the death of loved ones; 
fear/fright followed from traffic situations, physical aggression by others, 
facing the unknown, and‘ achievement-related situations; and anger/rage fol- 
lowed from failures of others to conform to social norms and from “in- 
appropriate rewards for self.” The investigators speculated from these find- 
ings that physical and psychological well-being, relationship concerns, and 
concerns for social order and justice were cross-culturally common personal 
goals or needs that thus evoke the same subjective emotions (see also Frida 
1986). : 

Deductive theories specify fewer and more abstract situational antecedents 
than do those that have been enumerated inductively (also see Averill 1978, 
Shaver et al 1987). Two well-elaborated, formal deductive theories are 
notable: Kemper’s social interactional theory, and Heise’s affect control 
theory. 

Kemper (1978, 1987) has argued that two fundamental dimensions of 
social relationships—power and status—are universal emotion-elicitors. Pow- 
er refers to compliance obtained through coercion or the threat of coercion. 
Status refers to deference given voluntarily or willingly, because actor A has 
high regard for actor B. Specific “structural emotions” are evoked during 
interaction episodes, when relational power and status are either maintained 
or changed. (Mixed emotions often occur because actors always have both 
power and status relations with each other.) For example, possessing suf- 
ficient or adequate power produces feelings of security; receiving sufficient or 
adequate status produces happiness. Individuals feel guilty if they perceive 
themselves possessing excess power, and fear-anxiety if they lack sufficient 
power. They feel shame if they possess excess status, and depression if they 
receive insufficient status. Other emotions depend upon the individual’s 
attributions (self is to blame, other is to blame) for loses or gains of power 
and/or status during an interaction episode. Kemper further proposes (from a 
positivist position) that the emotional outcomes of power and status rela- 
tionships have specific physiological correlates; in short, power and status 
relations are tied to certain basic emotions through specific physiological 
reactions, which is why both power and status dimensions of relationships and 
certain emotional experiences are cultural universals. 

The validity of Kemper’s specificity assumptions, and the many ambiguous 
or inconsistent points in the theory, cannot be discussed here due to limited 
space. Despite these problems, one of the most important aspects of the 
theory is that it isolates abstract structural factors that might reliably influence 
emotional experiences (and subsequent behaviors, if one further assumes, as 
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Kemper does, that unpleasant emotions motivate attempts to restore power or 
status). As such, it is one of the few systematic sociological explanations of 
emotion as a dependent variable. Unfortunately, direct experimental tests of 
the theory have not been made. 

Heise’s affect control theory (1979, 1986, Smith-Lovin & Heise 1988) 
represents another highly systematic, sociological approach. Heise’s goal 
actually is much broader than predicting specific emotions from antecedent 
social situational factors; his goal is to model entire sequences of symbolic 
interaction, using affective dynamics. 

Heise posits that the meanings of settings, actors, personal attributes, 
emotions, and behaviors are primarily affective; in short, every meaningful 
social category is affective. Affect can be assessed on a good-bad dimension 
(evaluation), a powerful-weak dimension (potency), and a lively-quiet dimen- 
sion (activity), yielding specific EPA (evaluation-potency-activity) profiles.* 
College student samples have rated hundreds of nouns, adjectives, and verbs 
on these dimensions, producing average EPA ratings, which Heise terms 
“fundamental” sentiments; they are ratings of social categories “out of con- 
text.” Student samples also have rated actors, settings, attributes, and be- 
haviors on the same three dimensions “in context,” i.e. when combined in 
short phrases or statements describing events (e.g. “The mother soothes the 
child”). In-context EPA ratings are termed “transient” sentiments. When 
transient EPA values (for example, for “mother” and “child”) are close to 
fundamental EPA values (for “mother” and “child’”’), a described event is 
called “confirming”—it conforms affectively to what is expected, con- 
ventional, or normative. When transient sentiments differ from fundamental 
sentiments, the event is “disconfirming” (e.g. “The mother is whipping the 
child” disconfirms fundamental affects about “mother” and “child’). 

Impression-formation equations have been developed to model differences 
observed between transient and fundamental EPA ratings (Smith-Lovin 
1988). These equations can be transformed to predict subsequent behaviors, 
because affect control theory assumes that people are motivated to experience 
transient feelings that confirm fundamental sentiments. For example, the 
equations can be used to solve for the next behavior that would bring transient 
feelings (e.g. about a mother who whips her child) back in line with fun- 
damental feelings (about mother). Generally, when affective expectations are 
disconfirmed, identity-restoring behaviors are expected, although when dis- 
crepancies are extreme, a new understanding of the actor, behavior, recipient 
or setting may occur (just as people in real life reassess the meaning of their 


“Note the similarity of Kemper’s power and status dimensions of relationships and Heise’s 
potency and evaluation dimensions--powerful/weak and likable/dislikable. Both Kemper and 
Heise draw these dimensions from Osgood’s semantic differential research (Osgood et al 1975), 
which indicates that these dimensions of meaning are cultural universals. 
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discrepant experiences). In short, Heise has developed simulation programs 
that not only identify expected and unexpected sequences of behavior, but 
may predict subsequent sequences of behavior from prior events, using 
affective dynamics. 

Importantly, the emotions that must be felt by actors and/or receivers to 
produce observed transient ratings can be imputed from program equations. 
For example, the simulation program INTERACT (Heise & Lewis 1988) 
estimates that the whipped child is feeling “afraid,” “flustered,” or “horrified” 
(Heise 1988). Emotions are treated as consequences of events in affect control 
theory. INTERACT thus would permit systematic explorations of the kinds of 
social situations that produce different emotions. Further, when emotions are 
treated as identity-modifiers in INTERACT (e.g., “angry father,” “sad 
teacher”), they help predict subsequent behaviors toward an object. Thus, 
emotions can be independent variables as well.” : 

An important implication of affect control theory is that individuals need 
not know detailed rules, norms, values, or skills in order to behave in 
socioculturally appropriate ways. “In effect, the affective system provides a 
way for storing much of the cultural heritage of reasoned action. People do 
not have to solve each problem anew, they do not even have to understand the 
logic of old solutions, they only have to acquire sentiments, and then they 
produce reasonable action as if by instinct” (Heise 1988 : 15—16).° Reasonable 
action occurs because affecis are combined and transformed in lawful ways 
that appear to hold cross-culturally (Smith-Lovin 1988). 

Although Heise’s theory offers a powerful explanatory scheme, it radically 
minimizes the complexity of cultural knowledge and of meaning. In the 
theory, culture and meaning are equivalent to affects which are attached to 
Situations, actors, and so on. Individuals’ active, purposive use of their 
cultural knowledge and the inferences individuals draw about themselves and 
others from emotions are missing from Heise’s scheme. This problem is not 
unique to Heise’s approach, as may be seen below. 

Kemper’s and Heises’s theories could be said to represent a “second face” 
of social psychological approaches to emotion: they are formal theories 


SHeise’s theory also might be used to identify emotion norms. The program estimates the 
degree to which actors’ and recipients’ feelings or displays in a described situation deviate from 
those that are expected, or normative. The smaller the deviation, the more normative the feeling 
or expressive display. Further, an approximation to the moral weight of these norms could be 
derived from predictions of recipients’ behaviors in response to an actor’s deviant emotional 
display. If the recipient avoids or punishes the actor, or attributes a deviant identity to the actor, 
then the emotional or expressive expectations that were violated were more than statistical 
conventions. 

This observation is relevant to an ongoing debate within psychology regarding whether 
“preferences need no inferences” or whether some information-processing precedes all affective 
responses (Zajonc 1980, 1984; Lazarus 1982, 1984). 
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explaining emotion itself as a phenomenon (Kemper), or treating emotions as 
key mechanisms linking social structure to individual behavior and vice versa 
(Kemper, Heise). Other formal theories (Scheff 1979, Collins 1981) are 
described below. 


Emotions as Motivators 


Emotion is treated as an independent variable much less frequently in micro- 
level approaches. Most sociologists assume, either explicitly or implicitly, 
that feelings motivate subsequent behaviors, but attempts to specify which 
emotions lead to which behaviors are rare (for exceptions, see Kemper’ 1978, 
Heise 1979, Smith-Lovin & Heise 1988). However, certain types of emotions 
have been linked theoretically to the outcome of social control. 

Shott (1979) delineated a subset of emotions as “role-taking emotions.” 
Reflexive role-taking emotions include shame, guilt, and embarrassment (on 
the unpleasant side) and pride and vanity (on the pleasant side). Reflexive 
emotions motivate self-control. The individual feels proud or ashamed, de- 
pending on the imagined reactions of others to his/her thoughts or behaviors. 
When ashamed, guilty, or embarrassed, the actor inhibits unconventional 
behavior and often behaves altruistically to reduce shame and restore pride 
(self-esteem). Empathic role-taking emotions, or vicarious emotions, result 
from mentally placing oneself in another’s position and feeling what the other 
might feel in that situation; these emotions include empathy, sympathy, and 
pity. Empathic emotions also motivate prosocial behavior—one helps others 
to reduce one’s own vicarious distress. Role-taking emotions are crucial to the 
smooth functioning of society, Shott argues, because societal members cannot 
monitor and sanction each person’s behavior at all times. Role-taking emo- 
tions motivate self-control, which is crucial more generally to social control 
(see also Denzin 1984). 

In a recent article, Scheff (1988) has focused exclusively on the con- 
formity-producing functions of shame. He proposes that individuals are 
almost continually monitoring their thoughts and behaviors from the stand- 
point of others. Two types of shame (overt, undifferentiated shame and 
bypassed [nonconscious] shame) are likely to produce conforming behavior, 
according to Scheff, although individuals with high self-esteem should be 
better able to withstand the shame of nonconformity and thus yield less often 
to imagined social pressure, while those with low self-esteem should be 
overly and rigidly conforming. 

That role-taking emotions produce conformity and/or prosocial behavior 
seems reasonable. Experimental studies of helping and altruism show that 
guilt and embarrassment do motivate helping efforts (Shott 1979). However, 
the full range of Shott’s and Scheff’s propositions require testing. Some 
studies (e.g. Stotland et al 1978) suggest that the relationship between 
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empathy and helping may be curvilinear; people experiencing very low and 
very high levels of empathy may be less willing to help. (High levels of 
empathy with victims may be so painful that individuals prefer to avoid 
victims rather than assist them.) Because role-taking emotions seem crucial in 
explaining the phenomena of social control and prosocial behavior, they 
deserve serious empirical attention. 


Emotions as Mediators 


Other social psychological approaches treat emotions as intervening variables 
in broader social processes. Emotions have been used to link macro factors to 
micro factors, micro to macro factors, or both. Kemper’s (1978) and Heise’s 
(Smith-Lovin and Heise 1988) theories make such links, as discussed earlier. 
Additional approaches are explored here. 

Scheff’s (1979) theory of catharsis moves from macro to micro. Scheff 
observes that modern society strongly inhibits the release of negative emo- 
tions (the reasons for these inhibitions are left unspecified). Adopting a 
psychodynamic perspective, Scheff assumes that the discharge of distressing 
emotions (grief, fear, anger, embarrassment, boredom) is a biological neces- 
sity. Because individuals learn to repress distressing emotions, bodily ten- 
sions accumulate and cause rigid or neurotic behavior patterns (emotional 
inexpressivity, cognitive blocks, lack of empathy and cooperativeness). For- 
tunately, societies also provide means for the “safe” discharge of collectively 
held, distressing emotions, through ritual, drama, and contests. To the extent 
that these rituals provide optimal “aesthetic distance” from distress, in- 
dividuals are able to discharge repressed emotions and restore physical and 
emotional well-being. Aesthetic or optimal distance consists of a balance 
between participating in and observing one’s own distress; one shifts attention 
back and forth between the distressing event (recalled from the past) and one’s 
safe position as an observer of the event (as audience in the present). Scheff 
defines optimal distance tautologically in terms of successful discharge 
(Scheff 1979: 67). Following numerous critiques (see Current Anthropology, 
1977, 18:490-504), Scheff began studying micromomentary changes in 
facial expression during the recall of past distressing events, to document 
alternations from emotional expression (i.e. presumed discharge) to emotion- 
al distance and back, to the final state of bodily relaxation (Scheff 1983). 

This choice of research strategy has an unfortunate consequence. Attention 
is focused on minute changes in. individuals’ expressive gestures and away 
from the broader sociological implications of the theory. As originally pro- 
posed (Scheff 1977), culturally universal negative experiences and collective 
emotion-management techniques (rituals, drama) were linked to individual 
behavior through a theory of emotion. These broader sociocultural phe- 
nomena, particularly the concept of collective emotion-management, deserve 
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elaboration. Why societies control the discharge of negative emotions, why 
some societies do so more than others, and whether there is a relationship 
between societal emotional regulation and the existence of collective dis- 
charge mechanisms are questions worthy of investigation. 

Collins’ (1981) theory of interaction ritual chains moves from microlevel 
processes to macro ones. Collins argues that macro phenomena (e.g. in- 
stitutions, organizations, social change) are simply the aggregate of micro 
interactions among actors; thus, one may study micro events to understand 
macro phenomena. The key mechanism guiding and motivating micro actions 
is emotion, specifically, emotional group identification and emotional con- 
fidence-—together termed “emotional energies.” Emotional energies are 
obtained and maintained in “interaction ritual chains.” 

Collins argues that conversations invoke shared realities, or myths; thus, 
conversations are simply chains of interaction rituals (IRs) which maintain 
shared myths. Participation requires cultural resources (acquiring the specific 
conversational styles and topics that symbolize and sustain group member- 
ship) and emotional energies (acquiring positive sentiments toward and from 
other group members). These produce another interaction resource, one’s 
social reputation (Collins 1987). Dominant individuals provide the con- 
versational leadership (i.e. they lead in invoking reality), generate emotional 
identification (and obtain emotional confidence), and are the topic of others’ 
conversations (their reputations are enhanced). 

Macrostructural realities of property, authority, and organization are cre- 
ated in the “particularized” content of conversation. Talk identifies people by 
title or organizational membership (paraphrasing from Collins: “the chairper- 
son,” “her husband,” “he works for company x”) and talk implicitly gives 
certain individuals the right to and the reputation for holding property and/or 
authority (“I went into her office,” “He wrote a memo telling us to do x”). 
“Particular individuals enact [and thus perpetuate] the property and authority 
structure because their previous IR chains give them certain emotional ener- 
gies and cultural resources, including the resource of the reputation for 
belonging in certain authority rituals and particular physical places” (Collins 
1981: 1004). Individuals who acquire large amounts of emotional energy (i.e. 
emotional attachments to others, emotional confidence) in the “marketplace” 
of ritual encounters will be able to instigate local, if not broader, structural 
changes through their verbal and nonverbal claims to property and/or author- 
ity. If they can marshall the emotional energies of others and manage an 
impression of a positive reputation (e.g. through the media), the market 
attractiveness of their coalition will be enhanced. To understand macro 
stability and macro change, then, one need only sample and audiovisually 
record conversations and behaviors across a large number of different social 
groups over time (Collins 1981: 1011). Recordings can be analyzed in detail 
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for indicators of the cultural and emotional resources attributed and claimed in 
ritual interactions. 

Like Kemper and Heise, Collins emphasizes affective causes of symbolic 
interaction and of observable behaviors. In all three theories, affective dy- 
namics mediate the relationship between broader culture and specific social 
structures (or patterned behaviors). However, Collins’ goal—accounting for 
macro structure and macro change—is broader than that of Kemper or Heise, 
and the affective dynamics are different. Stability and change are produced by 
emotional rewards and costs in the interactional marketplace (as well as by 
stability and change in cultural resources). Beyond the debatable assumption 
that macro phenomena are the sum total of micro events and the potential 
tautology in cost/benefit analyses, two methodological problems are evident. 
First, the key theoretical mechanisms, emotional energies, are undefined (and 
their exact roles unclear), so appropriate operationalizations are not obvious. 
Second, sampling and analyzing micro-interactions in order to study larger 
structures seems impractical. Researchers who study facial expressions, body 
language, and linguistic structures complain that five minutes of recorded 
interaction can require literally hundreds of coding hours. Whether the deeper 
understanding of macro factors obtained is worth the considerable investment 
involved in micro analyses is an empirical question at present. 

Note that in both Collins’ and Heise’s approaches, affect pervades interac- 
tion and affective principles or processes govern interaction. Missing from 
both approaches is an insight offered by symbolic interactionists: emotions 
have personal meaning for individuals. As several theorists (including Darwin 
and Freud) have pointed out, emotions serve a signal function, not only to 
others, but to the self (Hochschild 1983, Gordon 1989a, Heise 1979, Denzin 
1984). Denzin (1984) perhaps makes this point most eloquently in describing 
emotion phenomenologically as “self-feeling.” “An emotional experience that 
does not in some way have the self, the self-system, or the self or self-system 
of the other as its referent seems inconceivable. . . . It is through emotional- 
ity, imagination, sympathy, fellow-feeling, and revealed self-feelings that 
persons come to know themselves and one another” (Danzin 1984: 50, 245). 
From a symbolic interactionist perspective, the self motivates behavior. 
Emotion becomes a meaningful object to be interpreted, controlled, used, or 
managed by social actors, who are engaged in understanding themselves and 
managing others’ impressions of them. Recently, Stryker (1987) has at- 
tempted to integrate this insight into identity theory-—his formal symbolic 
interactionist theory of society, self, and behavior. 

Stryker conceptualizes the self as consisting of multiple identities, which 
are hierarchically ordered. Identities refer to internalized social roles (e.g. “I 
am a husband, father, professor, tennis player . . .”). Stryker (1987) notes 
that, like social structure itself, affect influences the probability that role 
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relationships will form. If affective meanings are shared with others, rela- 
tionships should be affectively positive and intense and thus highly salient. 
People are more likely to enact identities to which intense positive affect is 
attached and will attempt to deny identities associated with intense negative 
affect. Identity-enactments in turn should influence feelings. Successful 
identity performances generate positive affect (e.g. self-esteem or pride in 
self, respect or liking in others); inadequate performances produce negative 
emotions (e.g. embarrassment or shame in self, disappointment or anger in 
others). Because identity-affirmations or -denials require confirming be- 
haviors from role partners, emotional “outbursts” can be expected when role 
partners refuse to sustain a claimed identity. Emotions thus serve as signals to 
the self regarding the quality and acceptability of one’s identity claims and 
performances, and emotions can lead to changes in role behavior, network 
memberships, and ultimately, social structure. Strictly speaking, emotions are 
not intervening variables in this formulation; they are treated primarily as 
conditioning or moderating variables. 

Related ideas can be found in Hochschild (1979, 1983), Thoits (1985), and 
Gordon (1989a). Hochschild (1983, 1989b) has shown that individuals work 
very hard to transform unconventional feelings or displays into normative 
ones to allay guilt or sustain a positive self-image. I have suggested that 
persistent emotional deviance leads individuals to redefine themselves as 
emotionally disturbed and to seek help (Thoits 1985). Gordon (1989a), 
however, qualifies the implicit assumption that individuals always are dis- 
tressed by their nonnormative emotional experiences. He shows that in- 
dividuals who hold “institutional” orientations to emotion infer their “true” or 
“real” selves from feelings that are consistent with normative expectations, 
and infer “spurious” or “inauthentic” selves from their unconventional feel- 
ings or displays. In contrast, individuals holding “impulsive” beliefs about 
emotion infer true selves from unconventional feelings and false selves from 
conventional feelings. Thus, what emotions signal to the self (and, by exten- 
sion, signal to others) depends on the emotional beliefs that one has acquired. 
Note that, again, emotional states and subcultural orientations to emotion 
serve as conditions under which other self-relevant processes will occur. 

Symbolic interactionist theorists have acknowledged the importance of 
receivers’ interpretations and subsequent reactions to senders’ emotional 
signals but so far have not studied these nor their subsequent consequences in 
any detail (but see Hardesty 1986, Flaherty 1984). How do quarrels escalate? 
How do emotional support-giving attempts fail? How do service-workers 
successfully manipulate clients’ emotions? The reciprocal processes of per- 
ceiving, decoding, and responding to one another’s emotional signals deserve 
attention (see Buck 1984), as does variability in these processes by age, 
race/ethnicity, sex, and social class. Emotions are signals to the self, but also 
they are signals to others and the objects of others’ responses. 
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A number of formal emotion theories discussed earlier (e.g. Kemper 1978, 
Heise 1979, Collins 1981, Scheff 1979) treat affects as mechanical motiva- 
tors: positive affect leads to some types of behavior, negative affect to others, 
Incorporting the meaning of emotions for the self (and others) into theory 
helps specify when (and what) actions will follow from feelings. Emotions 
may provide impetus for action, but concerns about impression management 
and self-esteem direct that action. Because the social self is both a key source 
of motivation and a conceptual link between social structure and individual 
behavior, it may be unnecessary to posit close ties between specific emotions 
and subsequent actions, as Kemper (1978) does, for example. Emotions 
become objects to be interpreted, transformed,- and regulated through the 
social self as an active agent, rather than serving only as motivators. Thus, the 
symbolic interactionist perspective, which treats emotions as meaningful 
objects to social selves, can be viewed as the “third face” of social psycholog- 
ical approaches to emotion. 

In addition to their mediating role in more abstract theories of social 
structure and behavior, emotion concepts increasingly are being incorporated 
as intervening variables in substantive work. A number of examples can be 
cited. Wasielewski (1985) has reanalyzed charisma, arguing that it is not a 
personality characteristic, but springs from leaders’ skillful, deliberate manip- 
ulations of feeling and display rules, evident in the content of speeches and 
audiovisual recordings. Hochschild (1989a, 1989b) has examined how gen- 
der-role ideologies (traditional, transitional, and egalitarian ideologies) 
regarding the household division of labor are translated into strategic lines of 
action in two-job couples. Women and men employ different emotion man- 
agement strategies both to prepare for conflict over housework and to control 
feelings of anger, frustration, or guilt produced by unsuccessful negotiations. 
I have used emotions theory to derive new typologies of coping and social 
support responses to stress, and to identify conditions under which persistent 
emotional deviance will lead to self-attributions of psychological disturbance 
(Thoits 1984, 1985, 1986; see also Pugliesi 1987). Mills & Kleinman (1988) 
typologize battered women’s reactions to violence by analyzing the various 
relationships possible between cognitions and emotions. Ridgeway & John- 
son (1988) have developed hypotheses about affective interchanges to account 
for unexplained characteristics of small group interaction. Markovsky & 
Lawler (1988) explicate the role of emotional attachment in establishing 
group solidarity and explaining solidarity’s consequences. Sprecher (1986) 
examines specific emotions resulting from types of inequity in intimate 
relationships and, by implication, the effects of these feelings on overall 
relationship satisfaction and/or longevity. 

Although other examples might be cited, the point here should be clear. By 
“bringing emotions back in,” researchers in fields as diverse as social move- 
ments, gender roles, marriage and family, stress and mental health, deviance, 
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small group dynamics, and interpersonal relationships may further specify or 
explain important substantive phenomena. 


MACRO-LEVEL APPROACHES TO EMOTION 


In contrast to micro-level analyses, macro approaches usually treat emotions 
as dependent variables. There is one exception of which I am aware: Ham- 
mond (1983) views human “affective capacities” as socially causal (described 
below). Typically, the intensity of particular emotional experiences in the 
population (e.g. Lofland 1985) or changes in emotion culture (Stearns & 
Stearns 1986, Cancian 1987, Swidler 1980) are examined as outcomes of 
large-scale features of social life. Macro theorists tend focus on particular 
emotions, such as grief, anger, and love. 


The Shaping of Emotional Experience and Emotion Culture 


Lofland (1985) discusses sources of historical and cross-cultural variability in 
the experience of grief. She identifies four sociocultural factors which should 
affect the intensity and duration of grief in a society: (a) how much particular 
relationships are invested with significance, (b) the mortality rates of the 
group, (c) how much feelings are controlled or given free play, and (d) how 
much individuals are physically isolated from others in time and space, and 
thus able to focus on their feelings. Because these sociocultural factors vary 
over time and across societies, the private experience of grief likely varies as 
well (see also Aries 1982). For example, the higher the infant mortality rate, 
and thus the higher the risk of loss, the lower the emotional investment in 
children and the shorter and less intense the grief response to child deaths. In 
essence, Lofland suggests that one might study modal grief experiences as a 
function of demographic factors (among other sociocultural factors). 

The difficulty, as Lofland points out, is lack of data on the dependent 
variable, especially historically: Clinical observations, interview studies, and 
first-person accounts are available only in recent history and Western cul- 
tures. (Data on mourning practices are not relevant, as it is what people feel 
rather than what they do in response to death that is crucial to Lofland.) 
Nevertheless, her argument might be tested provisionally by the strategic 
selection and comparison of contemporary subcultures which vary on at least 
some of the sociocultural dimensions she has identified (see also Lofland 
1982). 

Stearns & Stearns (1986) have studied historical changes in American 
anger from the colonial period to the present. Because aggregate data on 
subjective anger experiences were lacking, they concentrated on “emotionolo- 
gy” or “emotion culture”-—conventions and standards regarding the experi- 
ence and expression of anger. As indicators, they utilized recurrent themes in 
popular magazines, child-rearing manuals, marriage manuals, and other 
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documents intended to teach sectors of the American public about anger and 
appropriate anger control. 

Drawing from Lasch (1977), Stearns & Stearns (1986) argue that in- 
dustrialization and new market conditions in the mid-1800s caused the home 
to be reconceptualized as an emotional haven (for men) from a heartless, 
competitive world. Consequently, anger control (in women) became crucial 
for harmonious marriage and family life (see also Cancian & Gordon 1989). 
Then, early in the 1900s, rising rates of labor unrest and white collar and 
female employment prompted growing concern about workers’ emotions. 
Maintaining morale and avoiding anger became labor relations goals thereaf- 
ter. These workplace norms subsequently affected the family, in the form of 
child-rearing advice against anger. Anger has been and remains the “central 
emotional enemy” for contemporary Americans, Stearns & Stearns conclude 
(1986:211). More recently, Stearns (1988) has examined historical increases 
in American concerns about sibling jealousy, arguing that changes in the 
affective intensity of family life (due to smaller size and permissive rearing 
practices) actually created more jealousy and thus reason for worry. 

Swidler (1980) also has examined Western emotional ideology, but in a 
more discursive way. She cbserves that traditional love myths once helped 
resolve contradictory cultural values, bridging the gap between the tasks of 
youth (self-realization, autonomy) and adulthood (commitment, self- 
sacrifice). Current love ideology emphasizes one side of the myth—self- 
actualization, growth, autonomy, and sexual expression, while deemphasiz- 
ing lifelong commitment, attachment, self-sacrifice, and sexual restraint. 
(The crucial unanswered question raised by these observations is, what has 
produced this shift in ideological emphasis?) Swidler (1985) later argues that 
culture provides a “tool kit” of values, traditions, rituals, and beliefs for use in 
constructing strategic lines of action, especially during unsettled periods. In 
other words, cultural values are the means by which behaviors are con- 
structed, rather than ends in themselves. Applied to Swidler’s previous 
observations, contradictions and complexities in love ideology can be used by 
individuals to justify very different strategic lines of action in their intimate 
relationships (see Hochschild 1989b for empirical examples). The key re- 
search question then becomes: Under what historical and structural conditions 
will some strategic lines of action, and thus some aspects of love ideology, 
become dominant? Swidler thus reframes the problem that a sociology of 
emotions might address.’ 


7A similar argument can be found in Elias (1978). In his monumental two-volume work, Elias 
traces the history of manners, which are rooted in feelings of disgust, shame, and embarrassment 
about human bodily functions. He connects the process of civilization (including changes’ in 
disgust and shame thresholds) to the rise of the modern state, and argues that hygienic reasons for 
certain practices served only as after-the-fact rationalizations for customs originating in structural 
change. 
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Cancian (1987) offers some answers to the problems raised by Swidler. She 
argues that the conflict between traditional love beliefs and the current 
emphasis on self-development is linked to an earlier polarization in gender 
roles. Modern American concepts of love (as dependency, tenderness, ex- 
pressiveness, self-sacrifice) are feminized, while the notion of self- 
development (as autonomy, independence, self-reliance) is masculinized. 
This polarization in images, Cancian argues, was produced by the shift from 
an agrarian to capitalist economy and the resulting gender division of labor 
(and economic power) during the 1800s. Women became responsible for the 
home sphere and for emotions; men for the work sphere and for achievement. 
During the 1900s, a number of trends (consumer advertising, lower birth 
rates, higher divorce rates, women in the labor force) placed more emphasis 
on self-fulfillment for both women and men. But cultural lags in gender-based 
images of love create serious conflicts in modern relationships. Cancian 
proposes that the current trend toward extreme independence and individual- 
ism that worries social critics (e.g. Bellah et al 1985) is only one possible 
outcome of these conflicts; the emerging androgynous image of love contains 
a potential resolution—interdependence—where partners are equally respon- 
sible for sustaining a commitment. Economic shifts (from free market compe- 
tition toward socialism) and changes in the masculine role (from emphasis on 
individual achievement toward cooperative achievement) may foster in- 
terdependence and new cultural forms of love. 

Overall, these descriptive analyses suggest that changes in emotion culture 
follow from certain large-scale institutional changes. Unfortunately, the rela- 
tionships between antecedent institutional changes and subsequent cultural 
changes have not been documented empirically in this literature. A structural- 
functional approach to emotion culture seems implicit in much of this work 
(e.g. Stearns & Stearns 1986, Cancian 1987)—i.e. emotion norms are pro- 
duced by and function to sustain dominant institutional arrangements. Even 
more implicit are assumptions about the particular emotions that result from 
structural arrangements, emotions that then become encouraged or dis- 
couraged by norms (e.g. industrialization and capitalist competition produce 
anger in workers, which then becomes the object of normative regulation at 
work and at home). If underlying assumptions were made explicit, macro- 
level research might move more quickly from description and speculation to . 
theory development and testing. 

Macro-level efforts to trace the historical, structural, and ideological an- 
tecedents of particular emotional beliefs are important for three reasons. First, 
they generate hypotheses that can be tested more systematically (e.g. see 
Cancian & Gordon 1989). For example, if within one country anger norms 
vary historically with changes in economic structure, then cross-cultural 
comparisons of the relationship between anger norms and economic structures 
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seem warranted. Second, macro-level research may specify the conditions 
under which micro theories are applicable. For example, concern with power 
and status (Kemper 1978) may differ systematically across societies or within 
societies through time. Finally, as a byproduct, the macro analysis of particu- 
lar emotional beliefs may help to disentangle the complex interplay of culture 
and social structure (Swidler 1985), and to identify the origins and functions 
of emotion norms. 


Affects as Organizing Principles 


One recent theoretical approach has reversed the usual causal direction of 
macro inquiry. Hammond (1983, 1988) has treated macro structure as the 
product of human affective capacities. Elaborating Durkheim, Hammond 
argues that social differentiation and social stratification are shaped by several 
“affective principles.” He assumes that social bonds are based on intense, 
positive affective arousal and that people seek arrangements which maximize 
this arousal (the principle of “affective maximization”). However, human 
affective resources are limited; prolonged intense arousal is physiologically 
debilitating, so humans must limit the number of others to whom they are 
affectively attached. Some individuals become the focus of intense arousal, 
and others not (the principle of “uneven attachment’). The greater the density 
of a population, the greater the likelihood that individuals will differentiate 
among others using observable characteristics (e.g. sex, age, race, lan- 
guage—classifications requiring little cognitive effort) and affectively stratify 
these differences (the principle of “social differentiation”). Thus, social in- 
equalities are produced and sustained by innate affective motives and affec- 
tive limitations, and vary positively with population density. Hammond 
(1988) explicitly contrasts the implications of affective maximization theory 
(which holds that virtually anyone or any group can become an object of 
affective attachment) with sociobiological predictions (which hold that innate 
tendencies favor kin selection). If reliable indicators of Hammond’s key 
concepts can be found (a potentially difficult task), the theory may be 
valuable in challenging sociobiological theory to a series of critical tests. 


DISCUSSION AND CONCLUSIONS 


In this review I have attempted to highlight recent works that have rich 
implications for hypothesis-generation or that suggest problems for further 
empirical pursuit. I also have pointed to important gaps in sociological 
knowledge about emotions. At present, the field appears to be headed simul- 
taneously in two major directions: the further understanding of emotions and 
emotional processes as sociological phenomena in their own right, and the 
application of emotion concepts and processes to substantive problems in 
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areas such as deviance and social control, marriage and family, gender roles, 
social movements, small group dynamics, stress and mental health, in- 
stitutions, stratification, and the sociology of culture. 

Clearly, the field has moved beyond the infancy stage of concept and 
perspective development into an early childhood of theory elaboration and 
exploratory research. Work is more often theoretical than empirical in nature 
at the micro level. In contrast, work is more often empirically descriptive and 
speculative than theoretical at the macro level. Additional systematic theory 
development at the macro level seems especially necessary. More descriptive 
research at both the micro and macro levels of analysis still is needed. 

Qualitative and/or descriptive research at the micro level might address 
serious gaps in our knowledge regarding the “epidemiology” of various 
emotions, emotional socialization processes (for both children and adults), the 
significance of role-taking emotions for social control and prosocial behavior, 
and processes of emotional interaction. Both micro and macro researchers 
might examine the content of emotion culture (or of particular emotion 
subcultures) in more detail. At the macro level, historical and cross-cultural 
relationships between structural arrangements and emotion norms need fur- 
ther documentation and elaboration. More use of comparative strategies at 
both micro and macro levels of research clearly is needed. Some theoretical 
efforts at the micro level (e.g. Kemper 1978, Collins 1981, Heise 1988, 
Hochschild 1983, Lofland 1985) have been elaborated sufficiently to warrant 
serious hypothesis-testing; in most cases, testing has yet to be undertaken and 
is the crucial next step. 

The need for more descriptive research and systematic hypothesis testing 
raises problems of measurement and methods, which I have side-stepped in 
this review because empirical efforts are so few, as yet. Sociologists of 
emotion have yet to confront measurement issues squarely. 

Emotion indicators and problems of inference obviously depend on the 
investigator’s level of analysis. At the macro level, two types of indicators are 
common: (a) frequencies, intensities, or durations of various emotional expe- 
riences, as self-reported in surveys or qualitative materials, and (b) content 
codes for various emotional beliefs, obtained from documents, records, 
ethnographies, and media such as textbooks, advice books, manuals, maga- 
zine articles, newspaper columns, novels, films, and the like. When emotion- 
al ideology and norms are the focus, the relationship between cultural beliefs 
and actual experience becomes especially problematic. One can be tempted to 
generalize from the content of emotion culture to modal emotional experi- 
ences in the population. Potential discrepancies (and cultural lags) between 
public standards and private experience must be acknowledged by research- 
ers, and, where possible, assessed with recourse to alternative indicators, 
such as personal interviews, diaries, letters, and trends in divorce rates, strike 
activities, and so on (Stearns 1985). 
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At the micro level, investigators have analyzed subjective self-reports 
(including personal introspection—see Ellis 1987), changes in facial expres- 
sion, and physiological changes (the latter two more often done by psycholo- 
gists) as indicators of emotional experience. Rarely do studies employ more 
than one type of indicator. The choice of measures is crucial for the validity of 
investigators’ inferences. Because feeling and display norms are influential, 
self-reports and observable behaviors are suspect; respondents’ impression 
management concerns may cause verbal dissimulation and behavioral mask- 
ing. Physiological indicators are highly complex and considerable disagree- 
ment still exists regarding whether they are reliably associated with particular 
emotions in a one-to-one fashion. Consequently, using multiple indicators 
and exercising considerable care to reduce social desirability bias seem 
warranted at the micro level. . 
Methodological problems are likely to loom large as more empirical studies 
and conflicting findings accumulate in the next stage of the field’s develop- 
ment. These problems probably will not be insurmountable but must be 
addressed explicitly, because the continued legitimacy, utility, and growth of 


this field will depend next on careful empirical work. 
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Abstract 


This paper reviews the current state of knowledge on sex differences in 
earnings in the United States. The paper has three sections. The first describes 
the phenomenon under consideration, reviewing what is known about the size 
of the wage gap, historical and life course variations in the wage gap, and race 
differences in the wage gap. The second section, which constitutes most of 
the paper, reviews explanatory theories advanced to account for the wage gap 
and the empirical evidence relevant to their evaluation. This section is divided 
into two principle parts. The first considers “supply-side” explanations that 
focus on the characteristics and decisions of individual workers. These in- 
clude the human capital theory of economics and alternative views offered by 
sociologists and social psychologists that focus on processes of socialization 
and allocation and the operation of social networks. All of these explanations 
attribute the sex gap in earnings to differences in the qualifications, in- 
tentions, and attitudes that women and men bring to the labor market, 
including the social ties that influence worker-job matches. The second part of 
the section considers “demand side” explanations that focus on characteristics 
of the workplace and actors within it. These explanations include theories of 
discrimination in the labor market developed primarily by economists and 
social psychologists and ideas about the evolution and persistence of a 
discriminatory wage structure put forward by institutional economists and 
sociologists. The final section suggests directions for future work. 
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DESCRIPTION OF THE PHENOMENON 
The Current Wage Gap 


Women and men earn at markedly different rates in the United States. Current 
estimates point to an hourly wage rate for women aged 16 and over that 
approaches only about 70% of the wage rate of men. Precise estimates of the 
gap vary with many factors: the precision in estimating hourly wage rates, 
whether wages from more than one job are considered, the inclusion of 
self-employment income, the age range over which estimates are computed, 
and the extent to which reporting error is minimized. The most recent 
estimate, based on data from the Survey of Income and Program Participation 
for wage and salary workers aged 21 to 64, indicates that women’s hourly 
earnings were 70% of those of men in 1986, an increase from 68% in 1984 
(Pear 1987; US Bureau of the Census 1987). These figures indicate a some- 
what smaller wage gap than do the more commonly reported data on annual 
earnings for full-time, year-round workers aged 16 and older, available from 
the Current Population Survey. The latter put the wage gap at 64% in 1983 
and 65% in 1985 (Bianchi & Spain 1986; Blau & Ferber 1986; US Bureau of 
the Census 1986). After adjusting for the sex difference in number of hours 
worked per week, these figures also increase. In 1982, the unadjusted figure 
was 62%, but it increased to 67% after adjustment (O’Neill 1985). 

Data on the usual weekly earnings of full-time wage and salary workers, 
also available from the Current Population Survey, indicate a somewhat 
smaller wage gap than do data on the annual earnings for full-time, year- 
round workers. In 1983, women’s usual weekly earnings were 66% of those 
of men, prior to adjustment for the sex difference in hours worked, and 72% 
after adjustment (O’ Neill 1985; Bianchi & Spain 1986; Blau & Ferber 1986). 
The estimate based on usual weekly earnings is somewhat smaller than that 
based on annual earnings because usual weekly earnings are reported only for 
an individual’s primary job, do not include all self-employment income, and 
include the income of individuals who worked only part of the year. 


Historical Trends 


The sex difference in earnings has changed little during the twentieth century 
despite major increases in, women’s labor force participation. After an in- 
crease in women’s wages relative to those of men during the early part of the 
century, the ratio of female-to-male wages changed little between 1920 and 
1980 (Goldin 1983; Smith & Ward 1984). In 1920, when about 20% of 
women were in the labor force, the ratio of female-to-male wages among 
full-time workers was estimated to be 63% (Smith & Ward 1984); in 1980, 
when well over half of women were in the labor force, the wage ratio was 
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60%. Over this 60-year time span, women’s relative wages drifted slowly 
downward until the mid-1970s, reaching a low of 59%. Not until after 1980 
did women’s relative wages show a significant increase. In 1983, women’s 
earnings reached 64% of men’s earnings, a level they previously reached in 
1955 (Smith & Ward 1984; Bianchi & Spain 1986; Blau & Ferber 1986). in 
1985, women’s earnings were 65% of men’s earnings. Data on usual weekly 
earnings, available only since 1967, show a similar pattern of stability until 
1980, when relative female wages began to increase (O’Neill 1985; Bianchi & 
Spain 1986; Blau & Ferber 1986). Thus, after a long period of stability, the 
female-to-male wage ratio has shown slight increases since 1980 and is now 
just a few points above its level in 1920. 


Life-Course Variation 


‘Women’s relative earnings are not constant over the life course but decline 
with age. Women earn less than men at all ages, but women’s earnings 
decline as a percentage of men’s earnings until about age 40 when the 
earnings ratio levels off. In 1983, the annual earnings of full-time women 
workers aged 25-34 were 73% of those of men, whereas women aged 45 to 
54 earned only 56% of what men earned (Blau & Ferber 1986). Although the 
earnings position of women in all age groups deteriorated during the middle 
part of the century, younger women began to experience increases in relative 
earnings while those of older women were still declining (Smith & Ward 
1984; Fuchs 1986a). The relative earnings of women under 35 began to 
increase during the 1970s, and it is the earnings of women in this age group 
that have shown particularly substantial increases since 1980 (Smith & Ward 
1984; O’Neill 1985; Fuchs 1986b; Bianchi & Spain 1986). Whether these 
younger cohorts of women will experience less deterioration in their relative 
earnings as they age remains to be seen, but analysis of age-related differ- 
ences in relative earnings within cohorts suggests that at least some of the 
improvement in the relative economic position of younger women will be 
retained (Bianchi & Spain 1986; Blau & Ferber 1986). 


Race Differences 


Although there is a large sex difference in earnings in all racial groups, the 
wage gap is somewhat larger among whites than among blacks and Hispanics. 
In 1982, data on the annual earnings of full-time, year-round workers in- 
dicated that the female-to-male earnings ratio was 62% for whites, 75% for 
blacks, and 73% for Hispanics (Bianchi & Spain 1986; Blau & Ferber 1986). 
For whites, as for the population as a whole, the wage gap narrowed very little 
until the early 1980s, and women workers actually lost ground relative to 
male workers between 1955 and 1982. For blacks, however, the wage gap 
narrowed progressively over this period. In 1955, the earnings of black 
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women relative to black men were 55%, but they increased to 75% by 1982. 
This gain occurred during a period when the earnings of black men were also 
rising relative to the earnings of white men (Smith 1979). Although data on 
the earnings of Hispanics are available only since the mid-1970s, a moderate 
increase in the relative earnings of Hispanic women has occurred since then. 
Blacks and Hispanics of both sexes earned less than whites in 1982, but race 
differences in earnings were considerably smaller among women than men. 
The earnings of black and Hispanic women were 91% and 82%, respectively, 
of those of white women. Comparable figures for males were 75% and 70%. 
Race differences in earnings decreased substantially for both sexes prior to 
1975 but have increased somewhat since then (Blau & Ferber 1986). 


EXPLANATORY THEORIES AND EVIDENCE 


Attempts to explain sex differences in earnings have been based primarily on 
economic theories of wage determination. Underlying these theories is the 
idea that wage determination can be treated as a special case of the determina- 
tion of commodity prices (Hicks 1968). The wage is hypothesized to equal the 
value added to the product by hiring the last unit of labor, or the marginal 
product of labor. Although it has sometimes been incorrectly assumed that the 
marginal product theory of wages implies that workers are paid on the basis of 
personal characteristics that affect productivity, the marginal product of labor 
is also affected by characteristics of the job and employer, demand for the 
product, and the supply schedule of labor (Granovetter 1981). 

Because the marginal product theory makes the simplifying assumption that 
labor is infinitely divisible into homogeneous units, as commodities were 
assumed to be, it can explain differences in wages between workers only by 
imagining the existence of some minimal unit of labor and assuming that 
workers vary in the number of units presented. This raises the question of 
what explains differences in the number of units. It was the inability of 
neoclassical theory to account for wage differences among workers that led 
first to the dominance of labor economics by “institutionalists” who diverged 
markedly from the neoclassical assumptions during the 1940s and 1950s (e.g. 
Reynolds 1951; Dunlop 1957), and then to the development of the fully 
neoclassical theory of human capital by Schultz (1961, 1963), Mincer (1958, 
1974), and Becker (1964). Although human capital theory dominates current 
research by economists, it was originally a response to treating the determina- 
tion of wages by marginal product as an exclusively demand-side issue, with 
productivity determined by capital investment and the technical relations of 
production. 

In sociology, treatment of the problem of wage determination within the 
“status attainment” tradition emphasized the effects of personal characteris- 
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tics, including family background, on earnings. Over the past decade, howev- 
er, sociological attention has shifted increasingly to consideration of the 
effects of labor market characteristics. Within both economics and sociology, 
therefore, wage differences among workers are recognized to be a function of 
the characteristics of workers and of extra-individual factors, including the 
characteristics of jobs and employers, which affect the prices for the firm’s 
output and the costs of other factors of production. Economists and sociolo- 
gists tend to differ on the extent to which they view the behavior of in- 
dividuals and firms as influenced by institutionalized rules and practices and, 
therefore, in the degree that the processes they are concerned with are molar. 
Economic theories of the determination of wages focus primarily on in- 
fluences in the present that impinge directly on wage rates, although theories 
based on the notion of the, career life cycle, which consider the problem of 
wage determination within the context of the adult life course, are an impor- 
tant exception. The theories of sociologists (and institutional economists), by 
comparison, focus more on the ways in which institutional arrangements (in 
families, schools, and firms) perpetuate the effects of influences at one point 
in time (in either the life course of an individual :or the historical progression 
of a society) on events at a later point in time. Thus, sociologists have been 
more concerned with processes of intergenerational transmission and with the 
historical persistence of wage differentials among occupations or job clusters 
and industries. 

The different emphases of economists and sociologists, coupled with prob- 
lems of empirical operationalization, have produced some confusion in at- 
tempts to explain sex differences-in earnings in the United States. Greatest 
interest and controversy have centered on the extent to which the wage gap 
can be attributed to voluntary choices by workers vs. discrimination in the 
labor market. Economists have tended to see sex differences in the pro- 
.ductivity-related characteristics of workers as a result of voluntary investment 
in market skills, whereas sociologists see such sex differences as emerging 
from a process of socialization that transmits the effects of past discrimination 
to new generations. Both disciplines acknowledge that some worker charac- 
teristics, such as the accumulation of labor market experience, can be affected 
directly by discrimination. There is also some awareness that associations 
between sex differences in worker characteristics and sex differences in 
earnings may be affected by other social processes that match workers to jobs, 
such as network ties and access to information. Thus, even sex differences in 
earnings associated with sex differences in worker characteristics cannot be 
ascribed neatly to voluntary choices by workers. 

Economists and sociologists have tended to attribute to discrimination the 
sex differences in earnings not associated with the productivity-related char- 
acteristics of workers. This approach has changed some in recent years with 
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the awareness that the labor market is highly sex segregated and that this 
segregation bears a strong relationship to sex differences in earnings. 
Although both economists and sociologists now attempt to take an in- 
dividual’s position in the labor market into consideration, sex differences in 
the structural location of employment cannot be ascribed neatly to an effect of 
discrimination, particularly if one wishes to separate the effect of past dis- 
crimination on the formation of worker preferences and expectations from the 
direct experience of discrimination in the labor market. Even if the goal is to 
identify the direct experience of discrimination in a particular cohort, sex 
differences in job placement may result from sex differences in worker 
preferences, discrimination, and other social processes that affect the sorting 
of individuals into jobs. In short, although sex differences in earnings are 
recognized to bear a relationship to sex differences in both worker characteris- 
tics and job placement, attempts to use knowledge of these associations to 
make inferences about the mechanisms producing sex differences in earnings 
have been problematic. Part of the problem is that sex is not only a basis for 
the contemporary allocation of workers to jobs. It has also played an impor- 
tant historical role in the typing of jobs as appropriate for some individuals 
and not others and in the development of an associated wage structure that 
appears to persist after the conditions that gave rise to it have changed. Thus, 
although the deliberate practice of discrimination may have declined in recent 
decades, the pervasive discrimination practiced at earlier times remains em- 
bedded in the wage structure and continues to produce sex differences in 
earnings via occupational socialization, the formalization of personnel pro- 
cedures within organizations, and other social processes that restrict informa- 
tion and opportunity in the matching of workers to jobs. 

Because of the way in which explanatory theories and evidence pertaining 
to the sex gap in earnings have developed, the. discussion that follows is 
divided into two sections. The first focuses on supply-side influences (or. 
worker characteristics) and the second on demand-side influences (or charac- 
teristics of the workplace and actors within it). As noted above, it is problem- 
atic to translate this distinction into one between the voluntary choices of 
workers and labor market discrimination. 


Supply-Side Influences 


Supply-side explanations of the sex gap in earnings have been developed 
within both the human capital theory of economics and the status attainment 
tradition of sociology. These explanations focus on the characteristics and 
decisions of individual workers, attributing the sex gap in earnings to differ- 
ences in the qualifications, intentions, and attitudes that women and men 
bring to the labor market. Since most research on the effects of worker 
characteristics on earnings has been done by economists, most research has 
been based on the human capital model. 
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HUMAN CAPITAL THEORY In human capital theory, workers are seen as 
rational actors who make investments in their productive capacities to 
maximize lifetime income (Becker 1964; Mincer 1974). The investments 
usually studied are education and on-the-job training, although the theory 
applies to other investments, such as effort, job search, geographic mobility, 
and health. Earnings differences among workers arise because workers make 
different initial and continuing investments that affect their productivity. 
Attempts to explain the sex gap in earnings within the human capital model 
have focused primarily on sex differences in education, on-the-job training 
derived through labor market experience, and effort. 


Education The median years of schooling attained by women workers are 
about the same as those attained by men, but other differences in the educa- 
tional distributions of the two sexes affect the sex gap in earnings (Blau & 
Ferber 1986). Women are more likely than men to complete high school, but 
men are more likely to complete four or more years of college. Although 
women now earn about half of the Bachelor’s and Master’s degrees awarded, 
they are less likely than men to earn doctorates and first professional degrees 
(Blau & Ferber 1986; US Bureau of Census 1986). At the level of secondary 
and higher education, the sexes also take different courses and specialize in 
different fields. For example, from high school onward males take more 
advanced mathematics and natural science courses than do females (Fox et al 
1979; Marini & Brinton 1984). The distributions of the sexes across voca- 
tionally oriented programs in high school also differ markedly (American 
Institutes for Research 1980), and although sex differences in fields of 
specialization at the college level have narrowed in recent years, they remain 
large (Jacobs 1985; Blau & Ferber 1986; US Bureau of the Census 1986, 
1987). 

Most research on the effects of education on earnings has focused on years 
of schooling. Although a well-documented relationship exists between years 
of schooling and earnings for both sexes, this measure can account for little of 
the current sex gap in earnings because there is virtually no sex difference in 
mean years of schooling (Corcoran & Duncan 1979). Sex differences in 
vocational preparation in high school, which arise from the sex-typing of 
occupations, are likely to have a more important effect on the earnings gap 
because of the strong relationship between occupational sex-typing and earn- 
ings among those without a college degree. Sex differences in fields of 
specialization in college and graduate school also have an important effect on 
the sex gap in earnings among college graduates (Daymont & Andrisani 1984; 
US Bureau of the Census 1987). 


On-the-job training According to human capital theory, individuals can 
increase their productivity not only through formal education but by learning 
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on the job. On-the-job training occurs in formal training programs provided 
by employers or as a result of informal instruction by supervisors and cowork- 
ers and simply by doing the job. Time away from the job, in contrast, can lead 
to skill depreciation. Because individuals are assumed to be able to choose the 
timing and amount of their investments in on-the-job training and to do so to 
maximize their lifetime earnings, it is argued that women who plan to leave 
the labor force may defer on-the-job training and therefore experience slower 
wage growth prior to interruptions than do individuals who do not anticipate 
future interruptions. 

Analysis of the work histories of women and men indicates that, on 
average, women complete fewer years of formal on-the-job training and 
acquire less labor market experience than men. They also spend fewer years 
on the same job and with the same employer, are more likely to work 
part-time, and have more labor force interruptions (Corcoran & Duncan 1979; 
Blau & Ferber 1986; US Bureau of the Census 1987). Studies of the effects of 
these variables on earnings indicate that both formal on-the-job training and 
labor market experience, particularly experience in the same job and with the 
same employer, bear positive relationships to the earnings of both sexes 
(Mincer & Polachek 1974, 1978; Sandell & Shapiro 1978; Corcoran & 
Duncan 1979). These relationships are of the same magnitude for the two 
sexes, although because different types of experience are not equally valuable 
and men spend relatively more time acquiring the more valuable types of 
experience, such as formal on-the-job training, the relationship of overall 
work experience to earnings is greater for men than women. Part-time work 
experience bears little relationship to earnings growth for either sex (Corcoran 
& Duncan 1979; Corcoran et al 1983). One problem with human capital 
theory as an explanation of the sex gap in earnings is that, although it is 
usually assumed that women’s lower investments in training and experience 
are a cause of the sex gap in earnings, women’s earnings may affect their 
attachment to the labor market (Gronau 1988). Another problem is that human 
capital theory does not explain why years of past work experience increase 
both the chances of receiving on-the-job training and the skill level of one’s 
current job much more for men than women (Duncan & Hoffman 1978; 
Gronau 1988). 

Although women experience some loss in earnings during periods of 
withdrawal from the labor force, as a result of possible depreciation of human 
capital, their earnings potential is restored rapidly after reentry to the labor 
force (Corcoran 1979; Mincer & Ofek 1982; Corcoran et al 1983). Thus, the 
only long-term effect of withdrawals from the labor force is that attributable to 
foregone labor market experience. There is also no evidence that a pro- 
spective withdrawal from the labor force is associated with lower job require- 
ments or lower wage growth prior to the interruption (Corcoran et al 1983; 
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Gronau 1988), although some evidence suggests that women who expect to 
interrupt their labor force participation to have children invest somewhat less 
in on-the-job training (Duncan & Hoffman 1978). In short, sex differences in 
measures of job training and experience are associated with a significant 
portion of the sex gap in earnings, but depreciation of human capital during 
periods of labor force withdrawal is not an important source of the wage gap. 


Effort The amount of effort expended per hour of work, where effort refers 
to the physical and mental energy brought to a job, also may affect earnings. 
Becker (1985) recently claimed that because married women are primarily 
responsible for household work and childcare, they invest less effort per hour 
in market work than married men. Thus, even if the amount of time spent in 
the labor force by the two sexes were equal, their earnings would differ. 
Several types of evidence do not support this argument. First, being married 
and having children have been found to affect women’s earnings only in- 
directly via effects on measured qualifications other than effort, such as labor 
force experience (Suter & Miller 1973; Mincer & Polachek 1974; Treiman & 
Terrell 1975; Sandell & Shapiro 1978; Corcoran & Duncan 1979; Hill 1979). 
Second, attempts to measure sex differences in labor force attachment (in- 
cluding absenteeism and self-imposed restrictions on job choice) and their 
effect on sex differences in earnings have found these measures to be either 
unrelated or weakly related to earnings (Corcoran & Duncan 1979). Third, 
sex differences in self-reports of work effort indicate that women actually 
allocate more effort per hour to market work than do men (Bielby & Bielby 
1988). Fourth, social psychological research on internal standards of personal 
entitlement suggests that, all else equal, women are likely to allocate more 
effort to work activities than men do (Major et al 1984). 


Job segregation In using the human capital model to explain sex differences 
in earnings, economists argue that women make lower human capital in- 
vestments than men because traditional family roles make the work lives they 
anticipate shorter and more disrupted. Women are seen as less likely to make 
investments that pay off only if they sustain high-level commitment to the 
labor force and that depreciate during periods outside the labor force. Econo- 
mists also use: this reasoning to account for sex differences in fields of 
specialization and job choice (Zellner 1975; Polachek 1978, 1981). Because 
women anticipate shorter and more disrupted work lives, it is argued that they 
avoid fields where the rate of technological change is rapid and concentrate 
instead on fields where the pace of change is slower and the cost of work 
interruptions lower. Similarly, it is argued that women choose jobs that do not 
penalize discontinuity in labor force participation. These are jobs requiring 
less investment in on-the-job training, particularly firm-specific training, 
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which can be expected to have lower rates of wage appreciation and higher 
starting wages (Zellner 1975; Polachek 1981). They are also jobs where 
depreciation of earnings during time out of the labor force is minimized. This 
reasoning suggests that there should be little mobility between predominantly 
male and predominantly female jobs. It also suggests that women employed in 
predominantly female jobs should experience lower earnings returns to each 
year of labor market experience, higher starting wages, and lower rates of 
depreciation when out of the labor force than women employed in pre- 
dominantly male jobs. Moreover, women who expect discontinuous work 
lives should be concentrated in predominantly female occupations. 

There is little evidence to support these predictions. First, there is consider- 
able mobility between predominantly male and predominantly female occupa- 
tions (England 1982; Rosenfeld 1983; Corcoran et al 1984). Second, women 
in predominantly female occupations do not experience lower wage deprecia- 
tion than women in predominantly male occupations (England 1982, 1984; 
Corcoran et al 1984). Third, even with education held constant, the starting 
wages of women in predominantly female occupations are lower than the 
starting wages of women in predominantly male occupations (England 1984; 
England et al 1988). Fourth, although evidence on whether women in pre- 
dominantly female occupations experience lower wage appreciation than. do 
women in predominantly male occupations is equivocal and difficult to 
interpret, lower appreciation in predominantly female occupations is con- 
sistent with human capital theory only if lower appreciation occurs in con- 
junction with the advantage of higher starting wages (England 1982, 1984; 
Corcoran et al 1984; England et al 1988). As noted above, women in 
predominantly female occupations do not have higher starting wages. Fifth, 
women with continuous labor force participation are no less likely to be in 
predominantly female occupations than are women who have experienced 
labor force interruptions (England 1982; Daymont & Statham 1983; Corcoran 
et al 1984; Wolf & Rosenfeld 1978), and there is no association between 
women’s ages and number of children and their employment in a pre- 
dominantly female occupation (England 1982; Daymont & Statham 1983; 
Rosenfeld 1984). Finally, women’s anticipated labor force attachment in 
adolescence and early adulthood has no significant effect on the sex type of 
the occupation in which they are later employed (Lehrer & Stokes 1985). 
Thus, predictions on job choice derived from the theory of human capital are 
not supported. 


Overall assessment Overall evaluation of human capital theory as an ex- 
planation of the sex gap in earnings has focused on attempts to assess the 
degree to which sex differences in human capital investments account for the 
sex difference in earnings. The usual approach has been to regress earnings on 
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measures of human capital separately for each sex and then to estimate the 
amount of the sex gap in earnings “explained” by sex differences in mean 
levels of investment. Estimates suggest that no more than about half of the sex 
gap in earnings is associated with mean differences in human capital in- 
vestments (Mincer & Polachek 1974, 1978; Sandell & Shapiro 1978; Corco- 
ran & Duncan 1979; Mincer & Ofek 1982; US Bureau of the Census 1987). 
Proponents of human capital theory claim that more of the wage gap would be 
explained if more precise and extensive measures of the qualifications of 
individuals associated with productivity were employed, but this argument 
has become less persuasive as measurement of human capital has improved 
and at least half of the wage gap remains unexplained. Although more aspects 
of human capital investment, such as motivation to maximize earnings and the 
content and quality of education and on-the-job training might be measured, it 
is unlikely that thése additional measures would go far in explaining the 
portion of the wage gap left unexplained by measures already considered. 

Attempts have also been made to examine the relationship between histor- 
ical change in the human capital investments of the two sexes and change in 
the sex gap in earnings. It seems anomalous to some that increases in 
women’s education ‘and labor force participation have not produced a signifi- 
cant reduction in the wage gap over the last several decades. However, 
economists have argued that the stability in the wage gap is consistent with a 
lack of change in the relative skill levels of female and male workers (Smith & 
Ward 1984; O’Neill 1985). Between 1950 and 1970, the educational attain- 
ment of men grew more rapidly than that of women, primarily because men 
received college and graduate education earlier and in greater numbers than 
women. Prior to 1970, the labor force participation of less educated women 
also grew more rapidly than that of more educated women. This selection 
process made the male educational advantage greater for the work force than 
for the population as a whole. Since working women’s education declined 
relative to men’s during this period, it has been argued that an increase rather 
than a decrease in the wage gap would have been expected. More recent 
increases in women’s educational attainment have now eliminated sex differ- 
ences in the attainment of Bachelor’s and Master’s degrees and have narrowed 
sex differences in the attainment of PhDs and advanced professional degrees. 
These changes may be reflected in the recently declining sex gap in earnings 
among younger workers. 

During the post-World War II period, the average labor market experience 
of women in the work force also changed little despite the rising number of 
female entrants to the labor force. The sex gap in experience actually widened 
during the 1960s because workers entering the labor force tended to be 
housewives with little prior experience. It then narrowed somewhat in the 
1970s as average levels of experience increased, particularly among younger 
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women (Smith & Ward 1984; O’Neill 1985; Blau & Beller 1988). Since 
1980, this trend has continued, and it may be a factor in the recently declining 
sex gap in earnings, since this decline has also occurred disproportionately 
among younger women. 

Some have argued that changes in education and labor market experience 
during the 1970s should have led to a greater decline in the sex gap in earnings 
than actually occurred. The decline in the wage gap may not have been greater 
because periodic recessions produced high unemployment and because rapid 
increases in the supply of new workers and women created downward pres- 
sure on the wages of less skilled workers (O’Neill 1985). Other factors, such 
as an increase in the wage difference between union and non-union workers 
during the 1970s (Freeman & Medoff 1984), may also have played a role. 
Because investments in human capital account for at most half of the sex gap 
in earnings, there is considerable room for other types of influences to have 
offset the decline in the wage gap that would be expected on the basis of 
changes in-human capital. 

Although sex differences in worker qualifications that may affect pro- 
ductivity are associated with the sex gap in earnings, these associations are 
consistent with explanations other than that afforded by human capital theory. 
As many critics, including some economists, have pointed out, the assump- 
tion that worker characteristics such as formal education, labor market experi- 
ence, and interruptions in employment reflect differences in productivity has 
not been adequately tested. These variables are assumed to reflect productiv- 
ity, but they may reflect other influences. For example, formal education may 
be used by employers as a credential for screening workers~“and may affect 
wages because it affects access to jobs (Spence 1974). Labor market experi- 
ence may affect wages because it is a proxy for seniority, so that wages rise 
with seniority and job tenure regardless of productivity. For women, labor 
market experience may not even be exogenous to wages if allocation to 
low-wage jobs offering little opportunity for advancement affects labor force 
participation. Even the short-term decline in women’s wages after a period of 
labor force withdrawal, which has been attributed to depreciation of human 
capital, may arise from temporary mismatches between worker skills and jobs 
as a result of incomplete information on the part of women and employers 
when women reenter the labor market (Corcoran 1979; Corcoran et al 1983). 

Although information on the actual productivity of workers is difficult to 
obtain, evidence on the relationships of worker characteristics such as formal 
education and labor market experience to productivity within occupations 
suggests that although these variables have some bearing on productivity, 
their effects on earnings are largely independent of productivity (Horowitz & 
Sherman 1980; Medoff & Abraham 1980, 1981; Maranto & Rogers 1984). 
Moreover, much variability is seen across occupations and work contexts in 
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the extent to which education and work experience affect either productivity 
or earnings (Horowitz & Sherman 1980; Spilerman 1986). Recently, new 
explanations of the relationships of education and experience to earnings have 
been surfacing. Some economists now attribute the relationship between 
experience and earnings to the desire of employers, particularly those who 
make large investments in screening and training workers, to retain workers 
and motivate high performance over time (Stiglitz 1975; Lazear 1979, 1981; 
Lazear & Rosen 1981; Carmichael 1983; Shapiro & Stiglitz 1984; Bulow & 
Summers 1986). Wage increases are seen as a means of eliciting effort from 
workers when the monitoring of workers’ efforts and outputs is prohibitively 
expensive. These new explanations deviate from the view that labor is paid its 
marginal product in each short period, assuming that workers are paid their 
marginal product over the life cycle or in some cases in excess of their 
marginal product. So far, none of these explanations has an empirical basis, 
and, as single-factor explanations, they seem unlikely to account for the 
diverse compensation schemes observed across occupations and work settings 
(Talbert & Bose 1977; Spilerman 1986). 

In contrast, sociologists view the relationships of education and experience 
to earnings as influenced primarily by the organizational, and even the 
broader societal, context in which the administrative arrangements that gov- 
erm salary advancement and promotions arise. Spilerman (1986) has sug- 
gested that education and experience bear weaker relationships to earnings 
when organizational rules rigidly prescribe the temporal paths of earnings and 
occupational advancement. These rigid schedules are usually found in work- 
places where the majority of workers are engaged in a very few career lines, 
or where multiple career lines exist but there is little opportunity for transfer- 
ring among them. Such schedules often result from unionization, since labor 
unions seek to standardize work arrangements. Even the effects of education 
and experience that exist under this type of personnel system vary across cities 
in ways suggesting an influence of general societal beliefs that educated and 
experienced workers should be paid more. Because such beliefs do not 
specify how much more, compensation schedules vary widely. Education and 
experience appear to bear stronger relationships to earnings in large nonunion- 
ized organizations that encompass many occupational specialties. However, 
even in these organizations, societal notions of equity and custom may affect 
wage structures. In Japan, for example, both seniority and family size are 
major determinants of salary in latge companies (Dore 1973). 

The most serious problem with use of the human capital model to explain 
sex differences in earnings is its exclusive focus on supply-side influences. 
The model emerged because of a deficiency in neoclassical theory in the 
analysis of labor supply, but its focus on the supply-side has diverted attention 
from older neoclassical ideas on marginal productivity, which at least recog- 
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nized that wages are not a function of skills alone. Because factors other than 
worker characteristics, such as the structural conditions of employment and 
the processes by which workers and jobs are matched, may affect wages, 
analyses that do not consider these factors in conjunction with worker charac- 
teristics overestimate the importance of worker qualifications or, at the very 
least, fail to identify the process by which such qualifications affect earnings 
(Granovetter 1981). 


ALTERNATIVE SUPPLY-SIDE EXPLANATIONS Although demand-side ex- 
planations of the sex gap in earnings, which are considered in the next 
section, constitute important alternatives to human capital theory, supply-side 
explanations other than the theory of human capital have also been offered by 
sociologists and social psychologists. These alternative supply-side explana- 
tions are examined before we turn to demand-side explanations. 


Premarket socialization and allocation According to human capital theory, 
individuals make investments in their productive capacities to maximize 
lifetime income. Because the two sexes anticipate different attachments to the 
labor market, they are seen as maximizing their lifetime earnings by making 
different investments in human capital and different job choices. This applica- 
tion of human capital theory implies that the educational and occupational 
choices of women and men are motivated by a desire to maximize monetary 
reward. 

Evidence on the educational and occupational orientations of children and 
adolescents contradicts this view. Studies of children’s occupational aspira- 
tions indicate that sex differentiation in occupational goals appears at an early 
age (Marini & Brinton 1984). Children tend to look at activities, including 
work, in terms of sex-appropriate categories, and their sensitivity to the 
rewards associated with adult jobs is conditioned by the values they learn to 
consider in choosing an occupation. Boys are more aware of the status and 
monetary rewards of jobs and assimilate this information early, whereas girls 
do not become aware of these dimensions until adolescence (DeFleur 1963; 
O’Bryant et al 1980). Boys also attach more value to jobs offering high 
monetary returns than do girls (Lueptow 1980; Herzog 1982). Coupled with 
the lack of empirical support (documented above) for the theory of occupa- 
tional choice developed within the human capital model, these. findings 
suggest that sex differences in the educational and occupational orientations 
of adolescents and young adults do not arise from conscious attempts to 
maximize lifetime income, where “income” is interpreted to mean earnings. 

An alternative view is that differences in the qualifications, intentions, and 
attitudes which women and men bring to the labor market arise from sex-role 
socialization and sex discrimination prior to entry into the labor market. 
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Within a society at a particular time, individuals come to adopt sex-specific 
behavior, attitudes, and dispositional traits through processes of socialization 
and allocation that perpetuate sex-role differentiation. A number of theories of 
sex-role socialization have been advanced that differ primarily in the mech- 
anism by which sex-typed behavior is learned (Huston 1983; Marini & 
Brinton 1984). Regardless of the process by which individuals learn behavior 
appropriate for their sex, the content of what is learned depends on the 
association of sex with particular types of behavior and attitudes in the society 
in which an individual is raised. Socialization is a process of cultural 
transmission, whereby sex-specific behavior practices and attitudes are passed 
on to new generations. 

The behavior practices and datades transmitted by socialization reflect 
those factors that give rise to and sustain a sexual division of labor, including 
the segregation of women and men in different jobs within the labor market 
(Marini 1988). Because they include not only supply-side influences, such as 
women’s biological role in childbearing and nursing, but demand-side in- 
fluences, such as changes in technology that affect the demand for certain 
types of labor and the discriminatory actions of employers, a clear separation 
of supply-side and demand-side influences is impossible. Claiming that the 
effects of labor market discrimination are reflected only in sex differences in 
earnings that cannot be attributed to sex differences in qualifications or tastes, 
including “voluntary” job choices, ignores the effects of labor market dis- 
crimination on the acquisition of worker qualifications and tastes. These 
effects may arise because past labor market discrimination affects existing 
behavior patterns and attitudes, which are transmitted to new generations via 
socialization, and because employer selection for jobs and job training in- 
volves discrimination that directly affects the accumulation of formal and 
informal on-the-job training. 


Social networks Because much of the sex gap in earnings cannot be 
accounted for by measures of individual characteristics, it has been suggested 
that differences in the social networks of women and men may affect their 
access to jobs and thereby play a role in explaining the wage gap. Some 
evidence indicates that personal contacts constitute important sources of 
information and influence in the job search process, and studies suggest that 
such contacts are likely to be used more by men than women (Granovetter 
1974; Corcoran et al 1980; Campbell 1988). However, whether this sex 
difference in use of personal networks has a significant effect on the sex gap 
in earnings remains an open question. In studying a small sample of male 
technical and professional workers, Granovetter (1974) found that men who 
heard about their.jobs through a personal contact had higher incomes than 
those who used more formal means of finding a job. On the other hand, based 
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on analysis of a national sample of workers in diverse occupations, Corcoran 
et al (1980) found that the use of personal contacts by either sex produced 
only short-term wage advantages. No evidence suggested that use of informal 
information or influence in getting a job had a widespread, permanent effect 
on wages. 


Demand-Side Influences 


Demand-side explanations of the sex gap in earnings.have focused primarily 
on discrimination and aspects of market structure that perpetuate its effects. 
These explanations consider characteristics of the workplace and actors within 
it, attributing the sex gap in earnings to differences in opportunities made 
available to the two sexes. 


DISCRIMINATION Theories of discrimination within the labor market have 
been developed most fully by economists and social psychologists, although 
attempts to document discrimination have occurred in all social science 
disciplines. These theories and the evidence pertaining to them are reviewed 
briefly below. For more detailed reviews, see Blau (1984), Madden (1985), 
and Blau & Ferber (1986). 


Prejudice According to economists, labor market discrimination by sex 
exists when “two equally qualified individuals are treated differently solely on 
the basis of their sex” (Blau & Ferber 1986, p. 229). Becker (1957) provided 
the basis for the modern neoclassical analysis of labor market discrimination 
in considering race discrimination. He viewed discrimination as arising from 
a personal prejudice or “taste” for not associating with members of a particu- 
lar group. Such tastes might be held by employers, coworkers, or customers 
and could result in employers not hiring members of that group. In the case of 
sex discrimination, discrimination usually affects the hiring of women for 
jobs that do not maintain what are considered to be socially appropriate role 
relationships between the sexes. Such tastes may exist regardless of beliefs 
about whether women are equally qualified for jobs they have not traditionally 
filled. According to Becker’s reasoning, employers with tastes for discrimina- 
tion act as if there is a nonpecuniary cost to associating with women in 
socially inappropriate roles and so they hire women only at a lower wage than 
men. When discrimination is widespread and the number of women seeking 
jobs is large, some women must accept lower wages to find jobs. In a 
competitive market, all employers will then pay the same rate for labor of a 
particular sex. Bergmann (1974) has argued that an aversion to hiring women 
has resulted in the crowding of women in a limited set of predominantly 
female occupations and that this crowding reduces the wages of women in 
these occupations relative to the wages of men whose employment is not so 
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restricted. However, the neoclassical view is that market forces should erode 
discrimination in the long min since employers who hire women at lowered 
labor costs are at a competitive advantage relative to employers who hire only 
men at higher cost. 

How, then, can discrimination withstand competitive pressures? Analysis 
of racially segregated housing (Courant 1978) suggests that aversion to hiring 
women in certain occupations increases the costs of women’s job search in 
those occupations and inhibits women from entering them. The costs of 
discomfort from working in a male environment may be added to the costs of 
job search, particularly if discomfort is increased by overt interference or 
harassment by male coworkers. Market forces also inhibit agents in a market 
from capitalizing on the financial incentives not to discriminate, hypothesized 
by Becker (Courant 1978; Akerlof 1985). For example, employers who wish 
to hire women in traditionally male jobs may incur costs in overcoming the 
disincentives to women to enter these jobs. Nondiscriminatory employers 
may also incur costs through the loss of potential future business from the 
disapproval of others. Akerlof (1985) demonstrates that if Becker’s assump- 
tion about the kind of market in which agents operate is altered and a fraction 
of traders are assumed to be willing to. boycott a firm that disobeys dis- 
criminatory social customs, there is no excess profit to be made by entrepre- 
neurs who do not discriminate. 


Performance expectations Models of “statistical discrimination” develop 
the possibility that employers discriminate not out of personal prejudice but 
because they perceive that, on average, women are less productive and/or less 
stable employees than men (Phelps 1972; Arrow 1973; Aigner & Cain 1977). 
Since employers make hiring and promotion decisions under conditions of 
incomplete information and uncertainty, they tend to use readily accessible 
information to help them predict an employee’s future productivity and job 
stability. If they perceive that, on average, women are likely to be less 
productive and stable employees than men, knowledge of a potential em- 
ployee’s sex, which can be observed without cost, may influence hiring and 
promotion decisions. Average differences in the expected work lives of 
women and men are particularly likely to affect access to jobs that require 
employers to invest in firm-specific training. As noted by Aigner & Cain 
(1977), if employers’ perceptions of expected sex differences in productivity 
are accurate, acting on those perceptions will not produce differences in the 
average earnings of women and men that can be attributed to discrimination, 
since workers of each sex would receive pay commensurate with the average 
difference in productivity between the sexes. However, the judging of in- 
dividuals on the basis of beliefs about group averages can be expected to 
result in discrimination against the individual, since those who are atypical for 
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their sex will be paid more or less than their individual productivity would 
warrant. 

Statistical discrimination is likely to be pervasive because the existence of 
consensual beliefs about differences between the sexes, or sex stereotypes, is 
well documented (Broverman et al 1972; Williams & Bennett 1975). Moreov- 
er, perceptions about the suitability of women and men for different types of 
work are based largely on sex stereotypes that are inaccurate (Kiesler 1975). 
Evidence that women and men are evaluated differently comes from studies of 
selection situations and performance outputs. Males are more likely to be 
selected or ranked highly for managerial, scientific, and semiskilled positions 
than are equally qualified females. When identical professional articles and 
paintings are attributed to a male or a female, the work believed to be done by 
a man is rated more highly than the work believed to be done by a woman. 
Not all studies report these differences, but most do (see Nieva & Gutek 1981; 
Wallston & O’Leary 1981). 

Evidence that sex bias in evaluation is attributable at least in part to the 
effects of sex stereotyping on performance expectations comes from the fact 
that sex bias is evident in assessments of the suitability of both male and 
female applicants for sex-typed jobs (Nieva & Gutek 1981). In a study of sex 
segregation in California firms, Bielby & Baron (1986) found that sex differ- 
ences on measures of aptitude alone could not explain the extreme sex 
segregation they observed across jobs. There is also evidence that personal 
characteristics which enhance sex stereotyping increase the degree of bias in 
evaluation. Studies of the evaluation of qualifications and the attribution of 
causes of performance in high-level, male-typed jobs indicate that physically 
attractive women tend to be disadvantaged by their appearance, whereas 
physically attractive men are advantaged (Heilman & Saruwatari 1979; Heil- 
man & Stopeck 1985). This difference arises because physically attractive 
women are perceived to be more feminine and therefore less likely to possess 
the “masculine” traits presumed to be required for success in male-typed jobs. 

Although some economists have argued that statistical discrimination based 
on inaccurate perceptions will be eroded by market forces, there are reasons to 
believe that this will take a long time if it occurs. First, inaccurate beliefs 
become self-fulfilling prophecies if they cause employers to give women less 
training and assign them to jobs where the costs of training are minimized. 
Under such conditions women are likely to confirm employers’ stereotypes by 
exhibiting the very behavior they expect. Second, human beings are more 
likely to attend to information that is consistent with their beliefs, ignoring 
inconsistent inputs (Hamilton 1981; Berger et al 1980). When confronted with 
individuals who challenge stereotypic beliefs, they therefore tend to retain 
their stéreotypes and view individuals who do not fit as exceptions, some- 
times creating a separate category or subtype to handle the conflicting in- 
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formation. Third, as long as there are even small differences between the 
sexes, these differences provide a basis for the maintenance of stereotypes 
that exaggerate actual sex differences. There is evidence that people com- 
monly believe sex differences in ability, dispositional traits, and -behavior to 
be far greater than they are (Broverman et al 1972). Sex also functions as a 
general status characteristic, providing a basis for the development of per- 
formance expectations even on tasks where there is no evidence to suggest 
that one sex should perform better than the other (Berger et al 1980). 
Statistical discrimination based on inaccurate perceptions will therefore not be 
easily eroded. Even in the unlikely event that sex stereotypes accurately (and 
precisely) reflect aggregate differences in the behavior of women and men, so 
that taking them into consideration might be seen as consistent with profit 
maximization, judging an individual’s suitability for a particular job on the 
basis of group characteristics rather than an individual’s own merits is unfair 
to the individual and has been outlawed by antidiscrimination laws and 
regulations. 


Evidence There has been little attempt to evaluate empirically and to com- 
pare alternative theories of discrimination. Most research on discrimination 
has sought to document the existence and extent of discrimination. Attempts 
to measure sex discrimination in earnings usually employ a “residual” 
approach, whereby sex differences in earnings that cannot be accounted for by 
sex differences in productivity-related characteristics are attributed to dis- 
crimination. In these analyses, earnings regressions containing measures of 
individual characteristics believed to be related to productivity are estimated 
by sex, and earnings differences due to sex differences in returns to the given 
set of characteristics are attributed to discrimination. These analyses suggest 
that discrimination not only exists but is extensive, since at least half of the 
sex gap in earnings is not associated with measures of the qualifications of 
individuals believed to affect productivity. However, this approach has 
numerous problems, which make meaningful conclusions about discrimina- 
tion impossible to draw. 

One problem, usually pointed out by economists who believe that market 
forces will erode discrimination, is that analyses employing this approach 
have not measured all of the individual characteristics that may affect 
productivity; consequently, the extent of discrimination may be overesti- 
mated. A different problem is that even the variables considered are only be- 
lieved to affect productivity; but they could affect earnings for a variety 
of other reasons. Yet a third problem is that many of the productivity-re- 
lated characteristics included in the equations estimated may be affected 
by discrimination. Treating sex differences in these variables as investments 
unaffected by labor market discrimination underestimates the effect of dis- 
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crimination. These “investments” may be affected by the direct experience of 
discrimination, or they may result from an effect of past discrimination on the 
returns expected from such investments by the two sexes. 

A fourth problem with the approach typically used to measure discrimina- 
tion is that it either ignores or deals ineffectively with the effect of sex 
segregation on wages. As noted above, the two sexes tend to be segregated in 
different jobs both within and between firms, and there are different wage 
rates attached to the jobs they hold. In early attempts to document the 
existence and extent of sex discrimination in earnings, the effect of job 
segregation by sex was ignored. Effects of sex differences in occupational 
choice that were not captured by sex differences in measures of human capital 
investments were labeled as arising from discrimination. No attempt was 
made to distinguish between supply-side choices resulting from socialization 
and discrimination prior to entry into the labor market (or, alternatively, from 
attempts to maximize lifetime earnings) and discriminatory treatment on the 
demand side. Recently, some analysts have included measures of job and firm 
characteristics in the equations estimated, usually as measured for occupation- 
al categories, industrial categories, and/or segments of the economy. Vari- 
ance explained by these variables is sometimes considered to reflect dis- 
crimination and sometimes not. Even with the inclusion of these variables, it 
remains impossible to determine the extent to which variation in wages 
associated with an individual’s location in the economy arises from individual 
self-selection, discriminatory allocation, or other processes that affect person- 
job matches. Making these distinctions will require different types of analy- 
ses, especially when it is recognized that past discrimination may condition 
the operation of all these processes in the present. 

More direct evidence of sex discrimination in earnings comes from analy- 
ses of data for individual firms, often carried out in connection with dis- 
crimination suits. Evidence pointing to the existence of discrimination has led 
to the settling of many suits in favor of the plaintiffs. The principal problem 
with this type of evidence is that it does not permit an economy-wide 
assessment of the extent and nature of discrimination, particularly since the 
process by which firms are selected for study makes it unlikely that they are 
representative of all firms. 


THE WAGE STRUCTURE Both economists and sociologists recognize that 
wages vary in ways that cannot be attributed to the characteristics of in- 
dividual job holders. As noted above, economists have long maintained that 
the marginal product of labor is affected by characteristics of the job and 
employer, demand for the product, and the supply schedule of labor.. More 
recently, both institutional economists and sociologists have argued that these 
market influences may give rise to a wage structure characterized by relatively 
permanent economic segmentation. 


SEX DIFFERENCES IN EARNINGS 363 


Industrial segmentation Employers vary in the kinds of jobs they offer and 
the wages they pay. Attempts have been made to capture this variation by 
assigning firms to economic sectors as well as to more disaggregated industry 
groupings. Most attempts to define economic sectors have identified two 
sectors: oligopolistic and competitive, or, alternatively, core and periphery. 
(See Wallace & Kalleberg 1981 for a review.) Firms in the oligopolistic (or 
core) sector are argued to be distinguished from those in the competitive 
(periphery) sector by their greater power in factor and product markets. 
Sometimes the government is treated as a third sector. 

Some evidence suggests that those employed in the oligopolistic sector earn 
more than those employed in the competitive sector, although this distinction 
appears to make more of a difference to men than women (Beck et al 1978; 
Kalleberg et al 1981; Ward & Mueller 1985). Employment in the government 
sector rather than in the private nonunionized sector also has a positive effect 
on the wages of women but little effect or a negative effect on the wages of 
men (Oaxaca 1973; Sandell & Shapiro 1980; Gronau 1988). Longitudinal and 
time-series analyses further indicate that movement between sectors has 
implications for the earnings of both sexes (Rosenfeld 1983; Taylor et al 
1986), and there is evidence that returns to human capital are generally higher 
in the oligopolistic sector than in the competitive sector (Beck et al 1980; 
Rumberger & Carnoy 1980). Sectoral location has been considered relevent to 
the explanation of sex differences in earnings primarily because women are 
disproportionately located in the competitive sector (Beck et al 1978; Rosen- 
feld 1983; Ward & Mueller 1985), although women earn less than men within 
each sector (Hodson 1978; Rosenfeld 1983; Coverman 1986). 

The dual economy notion has received considerable criticism for failing to 
capture the multiple and nonoverlapping dimensions on which firms and jobs 
vary (Baron & Bielby 1980; Bridges 1980; Hodson & Kaufman 1982; Cover- 
man 1986). Analysis of more disaggregated industry groupings has revealed 
that a number of variables on which industries can be characterized, such as 
market power, capital intensity, unionization, and firm size, have positive 
effects on wages. Market power, capital intensity, and unionization have also 
been found to be negatively associated with the position of women relative to 
men, as measured by employment of women in the industry (Luksetich 1979; 
Bridges 1980; Parcel & Mueller 1983; Ashenfelter & Hannan 1986; Hodson 
& England 1986) or the degree of occupational sex segregation (Bridges 
1982). 

As noted by England & Farkas (1986), variation in wages at the level of the 
industry or firm may be explained in part by the Hicks-Marshall laws that 
determine the elasticity of demand. Because a trade-off exists between wage 
rates and employment, whatever determines the elasticity of demand for labor 
affects the bargaining power of workers. One of the Hicks-Marshall laws 
establishes a relationship between the price elasticity of labor demand and the 
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price elasticity of demand for the product, indicating that the former is greater 
when the latter is high. Since the demand curve for the product is less elastic 
when a firm is a monopolist in the product market, and oligopolists tend to 
behave like monopolists, the relatively high wages in oligopolistic industries 
may be explained by less elastic labor demand curves. The other Hicks- 
Marshall law states that labor demand curves are more elastic when labor is a 
high proportion: of all production costs. Since labor is a lower proportion of 
costs in capital-intensive industries, the less elastic labor demand curves in 
such industries may explain their relatively high wages. Unionization also 
lowers the elasticity of labor demand, a relationship that helps to explain the 
higher wages under unionization. 

Firms that are able may pay higher than competitive wages to enhance the 
firm’s reputation, maintain high morale, establish a safety margin for future 
hard times, and simplify recruitment by maintaining a queue from which to 
hire workers (Reynolds 1951; Stiglitz 1973). With a chronic surplus of 
applicants, such high-wage employers may indulge a preference for em- 
ployees with characteristics that are unrelated to productivity and may do so at 
-no cost in productivity if the applicant pool is large enough to yield many 
applicants with similar productivity-related qualifications. Thus, high-wage 
employers may be more likely to discriminate, and industry characteristics 
that foster high wages are likely to be associated with a relatively dis- 
advantaged position of women in the industry. An alternative explanation for 
an industry or firm’s paying relatively high wages is the desire of employers 
to retain workers and motivate high performance (Shapiro & Stiglitz 1984; 
Bulow & Summers 1986). Since offering lower wages to women than men for 
the same work would create an incentive for women to quit or shirk, employ- 
ers have no incentive to hire women at lower wages in the same jobs as men. 
When they do hire women, they are therefore likely to hire them in different 
jobs at lower wages. 

Because the industrial characteristics identified above affect the kinds of 
jobs available within an industry, some of the relationships observed between 
industrial characteristics and the relative position of women may be due to 
variation in the distribution of occupations across industries. When Bridges 
(1980, 1982) standardized for variation in the occupational distribution, he 
found that market power was associated with greater occupational segregation 
but not with a lower representation of women. Women were underrepresented 
in capital-intensive industries even after standardization, but the distribution 
of women across industries was not associated with a wide range of indicators 
of industrial marginality. Women were found to be underrepresented in 
durable goods manufacturing primarily because they were underrepresented 
in the kinds of occupations found disproportionately in manufacturing. They 
were also underrepresented in mining and construction. 
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Although the level of unionization of an industry bears a positive relation- 
ship to wages (Freeman & Medoff 1984), women have benefited little from 
this relationship because they are much less likely to be employed in occupa- 
tions and industries that are unionized (Hodson & England 1986; Freeman & 
Leonard 1987). Unions have been found to raise the wages of women as much 
or more than the wages of men (Kalleberg et al 1981; Freeman & Leonard 
1987), but because unions are differentially found throughout sectors of the 
economy, unionization has had little effect on the sex gap in earnings for the 
economy as a whole (Freeman & Leonard 1987). 


Occupational segmentation Economists and sociologists have also argued 
that labor markets are segmented by occupation and job title. During the 
1940s and 1950s, institutional economists such as Kerr (1954) and Dunlop 
(1957) drew a distinction between internal and external labor markets. This 
distinction was later reintroduced by Doeringer & Piore (1971). The central 
idea is that administrative rules and procedures tend to set up separate markets 
for those already hired (an internal labor market) and those seeking employ- 
ment (an external labor market). A firm hires workers from the outside labor 
market into “entry jobs,” and other jobs are filled internally as workers 
progress on well-defined career ladders by acquiring job-related skills, many 
of which are firm-specific. Since the firm determines wages for job categories 
and allocates workers to them, wages are less directly affected by the impact 
of market forces. 

The concept of the internal labor market was developed further in what was 
first called dual and then segmented labor market theory (see Gordon 1972; 
Kalleberg & Sørensen 1979). This theory draws a distinction between primary 
and secondary jobs, arguing that the internal labor market is only one kind of 
work setting. Primary jobs emphasize long-term attachment between workers 
and firms and offer built-in career ladders and promotion opportunities, 
whereas secondary jobs do not offer these advantages. The distinction be- 
tween primary and secondary jobs may occur within the same firm or between 
firms, since primary jobs are considered more likely to be found in 
monopolistic, unionized industries, and secondary jobs in competitive in- 
dustries. 

A number of explanations for the emergence of these distinct labor market 
sectors have been given. Becker (1964) noted that when workers receive 
firm-specific training, it is in the interest of employers to bind them to the firm 
with promises of promotion, salary increases, and employment security. 
Bulow &.Summers (1986) have argued that job hierarchies offering promo- 
tion prospects and wage increases are an important means of motivating 
workers when individual performance is not easily monitored. Others, such as 
Spilerman (1986), have noted that workers have an interest in barring lateral 
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entry and in having high-level positions filled through promotion. Workers 
may also wish to limit employer discretion by having decisions about promo- 
tion and layoff tied to seniority. These worker interests cause labor unions to 
work to create more widespread job hierarchies and promotion. Because firms 
characterized by high profit levels, oligopolistic pricing, and large organiza- 
tional size can better afford to create internal labor markets, these firms have 
been argued to be more likely to do so (England & Farkas 1986). 

In keeping with hypothesized differences in the wage-setting processes of 
different sectors (but also with alternative explanations), measures of human 
capital such as education and work experience have stronger effects on the 
earnings of male workers in the primary than in the secondary sector (Oster- 
man 1975; Rumberger & Carnoy 1980). Some differences in returns to human 
capital across sectors have also been documented for women, although these 
are smaller and show some inconsistency (Coverman 1986). Empirical sup- 
port for the argument that women and other minorities are concentrated in the 
secondary sector is weak (Hodson & Kaufman 1982; Coverman 1986); and 
contrary to the prediction that there is little mobility between sectors, con- 
siderable intersectoral mobility has been documented (Wachter 1974; Kal- 
leberg & Sørensen 1979). Women are somewhat more likely than men to be 
in the secondary sector, much more likely to be in the subordinate primary 
sector (which includes many clerical jobs as well as relatively unskilled 
unionized jobs), and much less likely to be in the independent primary sector 
(which includes many professional, technical, managerial, and craft jobs) 
(Gordon 1982, cited in Hartmann 1987). However, within each sector women 
earn less than men, and knowledge of women’s sectoral placement leaves 
much of the sex gap in earnings unexplained (Rumberger & Carnoy 1980; 
Parcel & Mueller 1983; Coverman 1986). 

Dual and other segmented labor market theories have come under attack for 
the same reasons that theories of economic segmentation have been criticized 
(Wachter 1974; Baron & Bielby 1980; Hodson & Kaufman 1982). Es- 
sentially, they are too crude to meaningfully characterize the multiple di- 
mensions on which labor markets vary. This criticism is relevant to their 
usefulness in explaining the sex gap in earnings, since many jobs held 
predominantly by women are not integrated into the same kinds of job ladders 
as jobs held predominantly by men (Baron et al 1986). In general, women 
have more limited lines of progression, which afford less access to on-the-job 
training and managerial positions (Malkiel & Malkiel 1973; Duncan & Hoff- 
man 1978; Halaby 1979a,b; Baron et al 1986; Hartmann 1987). Job ladders 
are highly segregated by sex, and sex affects job placement even in sex- 
integrated ladders (Baron et al 1986). Since the relationship of sex to the 
clustering of jobs in promotion ladders and to the positioning of jobs in those 
ladders is not explained by the technical characteristics of jobs or the charac- 
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teristics of firms, sex appears to constitute an independent basis for job-ladder 
formation. 

Recent analysis of the degree of sex segregation in the labor force indicates 
that more than half of. the workers of one sex would have to change census 
detailed occupational categories to make the occupational distributions of the 
two sexes equal (Beller 1984; Blau 1988; Jacobs 1989). Within occupations 
male and female workers are also segregated both within and between firms. 
Using employers’ own job classifications in an analysis of job segregation in 
California firms in the 1960s and 1970s, Bielby & Baron (1984, 1986) 
estimated that 96% of workers of one sex would have to change job titles to 
equalize the distributions of the sexes across jobs. 

Sex differences in earnings are strongly tied to the segregation of women 
and men in different jobs. Although estimates of this association are affected 
by the degree of disaggregation of the job classification used (Sanborn 1964; 
Cohen 1971; Oaxaca 1973; Treiman & Hartmann 1981; Roos 1981), occupa- 
tional gas pea accounts for 35-40% of the sex gap in earnings when the 
census detailed (three-digit) occupational classification is used for the labor 
force as a whole (Treiman & Hartmann 1981). Occupational segregation 
accounts for almost twice as much of the earnings gap among those without a 
college degree (30%) as among college graduates (17%) (US Bureau of the 
Census 1987). If a more refined job classification is used, even more of the 
sex gap in earnings is associated with job segregation (Malkiel & Malkiel 
1973; Bridges & Berk 1974; Talbert & Bose 1977; Blau 1977; Halaby 
1979a,b; Miller et al 1980; Bielby & Baron 1984). 

Although the wage gap is strongly tied to job segregation, measured 
characteristics of jobs held by the two sexes do not account for all of the 
relationship between sex differences in earnings and job segregation. With 
measures of individual qualifications and job characteristics held constant, 
sex has an independent effect on earnings (Bridges & Berk 1974). Analysis of 
the relationship between occupational sex composition and earnings using 
aggregate data for the census detailed occupational classification also in- 
dicates that measures of the skill requirements and working conditions of an 
occupation (usually taken from the Dictionary of Occupational Titles) and 
average personal characteristics do not account for all of the relationship 
between occupational sex composition and earnings (McLaughlin 1978; Trei- 
man & Hartmann 1981; Kemp & Beck 1986, Parcel 1989). Similarly, job 
characteristics and the average personal characteristics of job holders do not 
account for all of the relationship between the sex composition of a job and its 
earnings level within a firm (Rosenbaum 1985). Although it is unclear 
whether (or how much more of) the relationship between the sex composition 
of an occupation or job and its earnings level would be explained if additional 
measures of occupational and average personal characteristics were consid- 
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ered, the analyses done to date call into question the claim that wage differ- 
ences between female-typed and male-typed jobs persist because of 
“compensating differentials,” that is, because female-typed jobs have other 
advantages over male-typed jobs (Killingsworth 1985; Filer 1985). Changes 
in the sex composition and earnings level of an occupation or a job are also 
related; an increase in the proportion of women is associated with a decrease 
in relative earnings (Ferber & Lowry 1976; Pfeffer & Davis-Blake 1987; 
Baron & Newman 1989). 

At the individual level of analysis, employment in a predominantly female 
occupation bears a negative relationship to the earnings of both sexes, 
although much of the relationship is attributable to individual and labor 
market characteristics (including other occupational characteristics) that affect 
earnings (England 1982; Daymont & Stratham 1983; Parcel & Mueller 1983; 
Taylor et al 1986; England et al 1988). Again, it is unclear whether this 
relationship would persist if additional measures of individual and labor 
market characteristics were available. Individual-level analyses have also 
documented considerable movement between predominantly male and pre- 
dominantly female occupations (Wolf & Rosenfeld 1978; Rosenfeld 1983; 
Corcoran et al 1984), and this movement seems to have little direct effect on 
earnings (Rosenfeld 1983). i 


Geographical segmentation The boundaries of local labor markets in con- 
junction with the forms of segmentation described above have implications 
for the sex gap in earnings. Robinson (1933) and, more recently, Madden 
(1973) have identified monopsony power by employers who are large buyers 
of labor relative to the size of a particular market as a source of imperfect 
competition that may produce and perpetuate discrimination. Monopsony 
power exists when there is one employer or a collective group of employers in 
a particular local labor market for particular worker skills. Monopsony power 
results in discrimination against women when female labor supply to the firm 
is less wage elastic, that is, when a wage change produces a smaller change in 
the number of female workers than in the number of male workers seeking 
employment. A monopsonist will pay less for female workers if their labor is 
less elastically supplied to the firm than is male labor. This situation may arise 
because women have fewer employment alternatives. 

Monopsony models are usually argued by both sociologists and economists 
to involve the collusion of individuals within a group. The monopsony model 
developed by Madden (1973) and the theories of patriarchy developed by 
Hartmann (1976) and Strober (1984) see collusion among men as keeping 
women out of high-paying jobs. It has been argued by some that such 
monopolies are inherently unstable because individual employers have an 
incentive to cheat and hire cheaper (female) labor, but such cheating will pay 
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off only if it does not draw sanctions or others do not follow suit. Collusion 
may also be supported by nonpecuniary group loyalties (England & McCreary 
1987). 

There has been little research on the extent of monopsony in the labor 
market, although there is some evidence of a positively sloped labor supply 
for specific industry and local area labor markets and for firms (Madden 
1985). There is also some evidence that monopsony power lowers the wages 
of women, especially in local labor markets where there are high degrees 
of union activity and occupational segregation (Cardwell & Rosenzweig 
1980). 


Jobs and firms During the 1940s and 1950s, when institutional economists 
began to stress the concept of “wage-structure,” they adopted the view that 
wages are attached to jobs and that the problem of wage determination 
requires an understanding of well-defined systems of jobs and firms. For 
example, Dunlop (1957) argued that within a firm there are groups of jobs, or 
“job clusters,” each of which is linked together by technology, the administra- 
tive organization of the production process, and the social customs of the 
work community. A job cluster usually contains one or more key jobs and a 
group of associated jobs, and the wage rates for the key jobs mediate the 
effects of labor market influences, including union and government wage 
policies, and forces in the market for products on the wage structure of the 
firm. Dunlop also argued that there are stable groups of wage determining 
units, such as bargaining units, plants, or firms, which are linked together by 
similarity of product markets, use of similar sources of labor supply, or 
common labor market organization. He called these “wage contours” and saw 
them as having three dimensions: particular occupations or job clusters, a 
sector of industry, and a geographical location. Differences in wage rates by 
wage contour were argued to reflect the basic nature of product and labor 
markets. 

The call of institutional economists for an understanding of well-defined 
systems of jobs and firms has recently been echoed in sociological critiques of 
attempts to define and measure labor market segmentation by cruder typolo- 
gies, including industrial and occupational categories (Baron & Bielby 1980; 
Baron et al 1986). Because there is extensive heterogeneity within these 
broader groupings, and there is no way to link them to the kinds of job clusters 
and wage contours described above, more disaggregated analyses of jobs and 
firms are needed. This need is underscored by results of analyses already 
undertaken at the job and firm level, which indicate substantial sex segrega- 
tion within census detailed occupational and industrial categories by both job 
title and firm. Analyses of sex-neutral and female-dominated occupations 
indicate that males are disproportionately represented in positions of prestige 


370 MARINI 


and authority within those occupations (Grimm & Stern 1974; Mennerick 
1975). In firms women also occupy lower-status positions within occupa- 
tions, usually segregated in different job titles, sometimes in different de- 
partments. Bielby & Baron (1984) found that over half of the California 
establishments they studied were completely segregated by job title and that in 
four fifths of them 90% or more of one sex would have to change jobs for the 
job distributions of the two sexes to be equal. This extreme sex segregation 
suggests that much of the sex gap in earnings is attributable to job allocation 
by employers rather than to occupational choices by workers prior to entering 
the firm. 

Within occupations there is also sex segregation by firm, with women 
disproportionately employed in lower-paying firms (Bridges & Berk 1974; 
Blau 1977; Talbert & Bose 1977; Ferber & Spaeth 1984). Blau (1977) found 
evidence of a wage hierarchy of firms within a local labor market that was 
consistent across occupations. The representation of women in the firm was 
also consistent across occupations and bore a negative relationship to the 
relative wage standing of the firm. This sex segregation within occupations by 
firm again suggests that much of the sex gap in earnings associated with job 
segregation is not attributable to sex differences in occupational choice but to 
discrimination by employers. 


The influence of gender The relative rates of wage contours reflect the 
nature of product and labor markets and are influenced by union and govern- 
ment policy. Because new and expanding industries or contours using the 
same type of labor have had to pay higher wages to attract labor, and because 
the differential rates of wage contours persist long after the market conditions 
creating them have changed, the wage structure reflects the pattern and speed 
of industrialization (Dunlop 1957). Industries developed during the early 
period of industrialization tend to have lower wage rates than those developed 
later. Similarly, new and expanding occupations or job clusters using differ- 
ent types of labor are affected by the relative supply of labor at the time of 
their emergence. When the market for a particular type of labor is tight, a 
higher wage rate develops than when the market is loose. 

Differential rates among wage contours persist because they become in- 
stitutionalized in the administrative rules and procedures of organizations, and 
workers come to regard them as proper. The relative structure of rates takes 
on normative elements, shaping the preferences and relative ratings workers 
give to jobs. That structural conditions can affect notions of equity and 
custom is evident in the apparent influence of unions on the wage-setting 
process in capitalist countries. In the United States, relatively weak and 
fragmented unions are believed to have led to more decentralized bargaining 
and greater earnings inequality than in some other capitalist countries (Gold- 
thorpe 1984). Employers in the United States therefore believe that skilled 
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workers ought to be paid much more than unskilled workers, even though the 
wage differentials that exist appear to be considerably greater than would be 
required to motivate the upward movement of labor (Reynolds 1951). 

Once structural conditions have shaped notions of equity and custom, the 
expectations of workers and employers influence subsequent wage-setting. 
Within the internal labor market, wages are determined primarily by man- 
agerial discretion and union-management negotiation. Since the best alloca- 
tions are those that maintain high morale and productivity, they tend to reflect 
workers’ concepts of equity. If it is believed that better educated and more 
experienced workers are entitled to more pay or that whites and men are 
entitled to more pay, wage differentials tend to reflect these beliefs (Madden 
1985). 

Sex-role differentiation and stratification have existed in all known human 
societies (Marini 1988). With the coming of industrialization, women became 
“secondary workers” in the labor market. Initially shut out of many jobs and 
constrained by their biological role in childbearing and nursing, women 
entered the labor market in smaller numbers and for shorter periods than men. 
Neither employers nor workers (including women themselves) initially saw 
women as entitled to the same wages as men. Thus, as the demand for labor in 
certain white-collar jobs increased after World War II, women constituted an 
available supply of labor with few good employment alternatives. Although 
prevailing notions of equity regarding the paying of equal wages for equal 
work made it difficult (and therefore probably unprofitable) to pay women 
less than men in the same job, sex-role differentiation made it possible to 
employ women and men in different jobs at different wage rates. Since there 
was little presumption that women and men were similar, employing them in 
different jobs, which presumably required different skills, was accepted and 
became common practice. As newcomers to the labor market with few 
employment alternatives, women accepted low pay. Employers offered low 
pay because they could attract labor at those rates. Comparisons that might 
have led to perceptions of inequity were inhibited by the fact that women and 
men were employed in different jobs. Thus, a sex-segregated labor force 
developed with substantial differences in pay between predominantly female 
jobs and predominantly male jobs. Just as other differentials between wage 
contours have persisted long after the market conditions creating them have 
changed, differentials between wage contours defined by gender have per- 
sisted. 

Like other differential rates among wage contours, those associated with ` 
sex segregation persist because of institutional and attitudinal barriers to their 
erosion. The formalization of personnel procedures within organizations tends 
to institutionalize discriminatory practices in the recruitment, allocation, and 
retention of workers in particular jobs (Roos & Reskin 1984). Since organiza- 
tional structures remain stable in the absence of concerted efforts to change 
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them, discrimination persists via the routine implementation of personnel 
practices. Evidence that older agencies in the California civil service system 
are more sex segregated than otherwise comparable, younger ones suggests 
that discriminatory practices existing at the time an agency was founded 
became institutionalized in the agency’s personnel procedures and thus per- 
sisted (Bielby & Baron 1987). 

Even when a need for change is recognized, internal labor markets slow the 
process of change by perpetuating the effects of discrimination in entry-level 
jobs throughout the careers of individuals and by reducing the amount of 
interfirm mobility. Since internal labor markets are more often found in 
high-paying firms, they are particularly likely to limit mobility into the most 
desirable organizations. They may even limit mobility across departments or 
jobs clusters within the same organization if narrowly defined seniority 
systems impose restrictions or penalties for transferring across units. The 
inertia brought about by institutionalized procedures is reinforced by per- 
ceived costs of change. Since employers incur costs in the recruitment and 
training of workers, they have an incentive to retain their existing workforce 
even if they could hire female workers from outside the firm at lower wages. 
They may even be slow to replace departing males with females if they 
perceive it would be disruptive to the male workforce. 

Job requirements instituted without discriminatory intent, such as upper age 
limits for entering apprenticeships (which are more difficult for women to 
meet because of childbearing) and veterans’ preferences (which almost al- 
ways accrue to men) can be discriminatory in their effects. Similarly, the 
physical design of machinery and lavatories can affect the suitability and 
attractiveness of traditionally male jobs for women. The interpersonal climate 
women encounter can also be important. When most or all of a woman’s 
coworkers are men who are unsupportive, they can create an unpleasant 
working environment and interfere with a woman’s job performance. Even if 
male coworkers do not overtly interfere, working primarily with men can be a 
disincentive to women’s employment in traditionally male jobs. 

Sex segregation and associated wage differentials are rooted not only in 
organizational structures but in a belief system that justifies and reinforces 
their existence. Employers come to see segregated structures as rational and 
efficient, and the preferences and expectations of workers are affected by 
perceptions of the sex-appropriateness of jobs. It has sometimes been argued 
that if women prefer female-typed jobs at lower wages to male-typed jobs at 
higher wages, female-typed jobs must have other advantages over male-typed 
jobs. As noted above, consideration of the skill requirements, working con- 
ditions, and other characteristics of occupations has not accounted for the 
relationship between the sex composition of an occupation and its earnings 
level. To the extent that consideration of additional measures of occupational 
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characteristics does not provide evidence that “compensating differentials” 
exist, it is reasonable to infer that the primary “compensating differential” is 
the sex type of the occupation itself. Both sexes demonstrate a preference for 
occupations with a high proportion of incumbents of their own sex. However, 
because female-typed occupations tend to pay lower wages than male-typed 
occupations, women must pay a price that men do not for the advantages that 
accrue from working in an occupation that is perceived to be sex-appropriate 
and that fosters interaction with same-sex peers. Women pay this price only 
because the opportunity structure does not afford them the option of choosing 
female-typed occupations that pay high wages. Thus, the association of sex 
segregation with wage differentials offers women less desirable employment 
opportunities than men, making the wage structure itself discriminatory. 


DIRECTIONS FOR FUTURE RESEARCH 


Greater understanding of the mechanisms producing sex differences in earn- 
ings in the United States will require new research on a number of fronts. 
Because sex has been a-basis for the allocation of workers to jobs and sex 
segregation bears a strong relationship to wage differentials, research on the 
origin and persistence of sex-based differentials in the wage structure is 
needed. This research should proceed by attempts to identify the emergence 
of wage contours defined jointly by occupation or job cluster, a sector of 
industry, geographical location, and sex. Understanding the conditions -that 
give rise to the emergence of sex-based wage contours can provide informa- 
tion useful for assessing the nature of sex discrimination. For example, it is 
unknown to.what extent wage rates in female-typed jobs are influenced by the 
crowding of women into a relatively small number of jobs, as suggested by 
Bergmann (1974). Are women paid less because of occupational crowding or 
because they are considered to be entitled only to lower wages? 

Of even greater contemporary interest than the conditions giving rise to the 
emergence of sex-based wage contours is the process of change in the wage 
structure. Although conflicting arguments have been advanced about the 
erosion of discrimination by market forces, there has been little empirical 
research on the actual process of change. Little is therefore known about how 
much change occurs, under what circumstances, and how quickly. Is change 
slow because organizational structures and belief systems change slowly, or 
are there barriers to the free operation of the market that permit some wage 
differentials to persist indefinitely? 

Change in the wage gap can result from changes in job segregation by sex 
or changes in the relative rates of wage contours. It is of interest to identify the 
impact of these two types of changes as well as their interrelationship. Has 
there been a decline in the association between sex segregation and wage 
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differentials? If so, has it occurred primarily because of a shift in the sex 
composition of jobs or because of change in the relative wage rates of jobs? 
Has it occurred through the emergence of new occupations, industries, orga- 
nizations, and jobs or through changes in existing structures? To understand 
the susceptibility of sex-based wage differentials to erosion by market forces, 
it is important to determine whether the relative rates of sex-based wage 
contours persist despite changes in the supply of particular types of labor. It 
is also important to understand the relationship between changes in a job’s 
sex composition and earnings level. There is evidence that these two vari- 
ables change together, but it is unknown what produces their relationship. 
Finally, it is important to identify the process by which women are actually 
integrated into the workplace, determining why women are hired, whether 
they are hired in the same jobs as men, whether they are hired at the same 
or lower wage rates than men, and whether their hiring gives firms a compe- 
titive advantage. 

Given the existence of a sex-segregated labor market and associated wage 
structure, research on the process by which individuals are sorted into differ- 
ent positions within the wage structure is also needed. This type of research 
requires longitudinal analysis of birth cohorts from at least the teenage 
through the adult years. By following cohorts over the life course, it would be 
possible to determine more accurately the relative influence of choices made 
.by workers, discrimination in the labor market, and other social processes that 
affect worker-job matches. Initiating data collection in adolescence permits 
the measurement of productivity-related characteristics and occupational 
goals prior to labor market entry. It also makes it possible to study sex 
differences in the extent to which occupational goals are realized and to 
identify factors responsible for sex differences in the discrepancy between 
aspirations and attainment. To the extent that sex differences in earnings are 
not attributable to productivity-related characteristics or occupational goals at 
entry into the labor market, the process by which they emerge after labor 
market entry can be examined. It is of particular interest to identify the 
influence of job placement early in an individual’s career on the cumulation of 
on-the-job training and subsequent career advancement. Intercohort com- 
parisons would permit analysis of changes in these life-course processes. 

In addition to research on the wage structure and the process by which 
individuals are sorted to positions within it, detailed research on particular 
aspects of the process is needed. Application of the theory of human capital to 
the explanation of sex differences in earnings rests on the assumption that, 
within constraints imposed by the sexual division of labor, individuals are 
motivated to maximize earnings. Research on the determinants of educational 
and occupational goals are needed to test this assumption. How important are 
expected life-time earnings compared to other aspects of an anticipated career 
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in the decision-making of individuals? How do the two sexes differ in their 
knowledge and perception of various jobs? How do the channels of informa- 
, tion through which they learn about jobs differ? Do the sexes value various 
attributes of jobs differently? Some research on these questions has been 
carried out, but it has not been linked to thorough analysis of the determinants 
of sex differences in educational and occupational expectations. 

In addition to studying sex differences in the process of educational and 
occupational decision-making, research on sex differences in the process of 
job search is needed. Given the same occupational goals, do the sexes use 
different methods of job search? Do they have access to the same information 
about available jobs? Does one sex elicit and/or receive more help than the 
other? Do they respond differently to indefinite or discouraging messages? 
Are members of one sex more assertive in putting themselves forward than 
those of the other? Does one sex invest more in search to get the same return? 

Finally, research-is needed on the process by which employers make hiring 
and promotion decisions. Although several theories of discrimination have 
been advanced, there has been little attempt to study the motivation (prej- 
udice, performance expectations) or source (employers, coworkers, custom- 
ers) of discrimination against women. It is quite likely that prejudice and 
performance expectations are related and that discrimination has more than 
one source. Direct surveys of actors in the workplace (employers, coworkers, 
customers) and experiments eliciting information about the process of em- 
ployer evaluation can inform us about the attitudes and beliefs that produce 
discriminatory outcomes. Research is also needed on the extent to which 
discrimination results from employer discretion or the routine implementation 
of institutionalized employment practices. 
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Abstract 


This paper defines the concepts of gender and social reproduction as de- 
veloped in feminist theory and discusses their utility for synthesizing recent 
historical research on women. We review literature on the emergence, in- 
stitutionalization, and reorganization of “separate spheres” in nineteenth and 
early twentieth century Europe and North America. Focusing on social class 
differences in family strategies, procreation, sexuality, consumerism, pro- 
fessionalization, and state policy, we argue that the organization of gender 
rélations and social reproduction crucially shaped macrohistorical processes, 
as well as being shaped by them. 


INTRODUCTION 


The two concepts central to this review—gender and social reproduction— 
have been developed out of efforts to understand relations among women and- 
between women and men under different historical, social, and cultural 
conditions. This essay defines the new meanings of the terms gender and 
social reproduction as they have developed within feminist theory, reviews 
the recent historical literature that has informed the development of these 
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concepts, and indicates their contribution to current theoretical debates.! 
(Scott 1986, 1988, Lamphere 1987a, and Moore 1988 review similar debates 
in history and anthropology, respectively.) 


Gender 


Feminist scholars argue that, like class, gender constitutes one of the basic 
dimensions of all social organization. Gender refers to socially constructed 
and historically variable relationships, cultural méanings, and identities 
through which biological sex differences become socially significant. Gender 
is seen, not as structurally determined, but as the outcome of women’s and 
men’s actions under historically specific conditions. More than an individual 
characteristic, gender refers to the social relationships between women and 
men that shape personal identity. In addition to being constructed and recon- 
structed interactionally, gender is institutionalized in, for instance, families 
(Brenner & Laslett 1986), schools (Hansot & Tyack 1988), politics (Nelson 
1989), culture and ideology (Douglas 1977; Scott 1987), the labor market and 
the workplace (see, for instance, Baron 1990; Cockburn 1983; Groneman & 
Norton 1987; John 1986; Milkman 1985; Rose 1986, 1987, 1988). The 
history of these institutions, therefore, is relevant to the history of gender. 

Central to the theoretical debates of concern to feminist scholars are the 
ways in which a gendered division of labor has been constructed and main- 
tained. Here the historical literature has been particularly helpful, pointing out 
variation over time in the meaning of masculine and feminine, in the ways 
that women’s and men’s work has been organized, and in the power to sustain 
or change these meanings and divisions of labor. In analyzing how the social 
organization of gender identifies, structurally and ideologically, some work as 
women’s work and other work as men’s, debates within feminist theory have 
given a new meaning to the concept of social reproduction. 


Social Reproduction 


Writing on the gendered division of labor, feminists use social reproduction to 
refer to the activities and attitudes, behaviors and emotions, responsibilities 
and relationships directly involved in the maintenance of life on a daily basis, 
and intergenerationally. Among other things, social reproduction includes 
how food, clothing, and shelter are made available for immediate consump- 


'For the most part, the current historical literature focuses on the late eighteenth through the 
early twentieth centuries, primarily in Britain and the United States and the synthesis presented 
here reflects the histories of those times and places. The question of how this synthesis might 
change as the available literature expands both temporally and geographically is touched on 
briefly at the end of this essay. While space limitations make it impossible for us to provide full 
bibliographic access, particularly to historical materials that predate the current decade, such 
access is available through the works we do cite. 
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tion, the ways in which the care and socialization of children are provided, the 
care of the infirm and elderly, and the social organization of sexuality. Social 
reproduction can thus be seen to include various kinds of work—mental, 
manual, and emotional—aimed at providing the historically and socially, as 
well as biologically, defined care necessary to maintain existing life and to 
reproduce the next generation. And the organization of social reproduction 
refers to the varying institutions within which this work is performed, the 
varying strategies for accomplishing these tasks, and the varying ideologies 
that both shape and are shaped by them. 

This definition of social reproduction needs to be distinguished from its 
other meanings. Social reproduction has also been used to signal the social 
nature of procreation and/or population processes in general; one of its 
meanings, therefore, is demographic. Feminists would agree that procreation 
is a social, not merely a biological, event, but they argue that while procrea- 
tion is a key component of it, the work of social reproduction includes much 
more than procreation. Another definition of social reproduction refers to the 
perpetuation of entire social systems, that is, to societal reproduction. In its 
most common usage, within Marxist theory, social reproduction refers to the 
perpetuation of modes of production and the structures of class inequality 
inscribed within them. 

The feminist usage of social reproduction draws on and challenges this 
approach. First, while beginning, as does Marxist theory, with necessary 
labor, the feminist concept of social reproduction broadens its definition to 
include the work of maintaining existing life and reproducing the next genera- 
tion. It therefore involves more than production as Marxist theory has defined 
it. Second, feminists who use the concept of social reproduction do so in order 
to understand the perpetuation and reproduction of systems of gender inequal- 
ity, in relation to but different from the reproduction of systems of class 
inequality. In the feminist argument, renewing life is a form of work, a kind 
of production,.as fundamental to the perpetuation of society as the production 
of things. Moreover, the social organization of that work, the set of social 
relationships through which people act to get it done, has varied widely and 
that variation has been central to the organization of gender relations and 
gender inequality. From this point of view, societal reproduction includes not 
only the organization of production but the organization of social reproduc- 
tion, and the perpetuation of gender as ‘well as class relations. 

Using the concept of social reproduction in this way allows us to ask new 
questions and to view old distinctions in new ways. For example, viewing 
social reproduction as work, feminists have argued that in capitalist societies, 
the unpaid domestic work of women converts wages into means of subsis- 
tence. Without it, wage workers could not continue to engage in productive 
work (Lamphere 1987b; Luxton 1980; Smith 1987). This analysis of social 
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reproduction as work contests the analysis of Talcott Parsons three decades 
ago, (Parsons & Bales 1955:16) “that the functions of the family in a highly 
differentiated society are not to be interpreted as functions directly on behalf 
of the society, but on behalf of personality.” 

In addition, although the work of social reproduction has-historically been 
done within families, feminists argue that the family ought to be understood 
as simply one possible institution for accomplishing these tasks. Indeed, 
variations in the distribution of this work between family, market, community 
and state—and between women and men-—-are central questions within femi- 
nist theory and continue to be examined, both theoretically and empirically. 
In the most recent scholarship the accent is on variation and human agency, 
and particularly on the ways in which women have constructed their own 
worlds of activity. (See, for instance, Cott 1987; Evans 1989; Gordon 1988; 
Stansell 1986; and Tilly 1988). As space permits, we indicate some of the 
current debates within this literature. 

This approach differs considerably from modernization theory, the major 
‘sociological paradigm concerned with changes in the institutional location of 
the tasks of social reproduction. Modernization theory tends to treat the 
gendered division of labor as an inevitable consequence of biological differ- 
ences in procreation. It assumes that historical changes in the organization of 
the family, child rearing, etc, are mandated by system imperatives or automa- 
tic structural processes such as differentiation and/or specialization. But, as 
we have argued elsewhere (Brenner & Laslett 1986), the organization of 
social reproduction is determined, at least in part, by political, economic, and 
gender struggles—by how well, and with what ends-in view, families as 
members of social classes have organized to ease their responsibility for 
dependents and by how well, and with what ends in view, individuals as 
gendered actors have organized to restructure this gendered responsibility. 

The scope of this essay is limited in various ways. We do not consider the 
theoretical literature that discusses the concept of social reproduction from a 
feminist perspective, nor the relevant empirical literature on contemporary 
societies. Rather, the remainder of this paper reviews the recent historical 
literature, beginning with two concepts central to it: family strategies and 
separate spheres. > 


Social Reproduction and Family Strategies 


The social reproduction approach connects with the now broad literature on 
family strategies in several ways. “Family strategies” refers to the decisions a 
family makes about the size and composition of the household and. the 
distribution of its labor force within the household and between domestic and 
nondomestic institutions. (See Goldin 1981 as one example of: the micro- 
economic approach to family strategies in historical research; for other ap- 
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proaches, see Morawska 1985 and Lamphere 1987b). Decisions about the 
timing of marriage, number of children, coresidence of nonkin and extended 
kin, and about which family members should work outside the household or 
within it, are understood to be strategic because they are made with particular 
ends in view—for example, to increase the family’s resources or to provide 
for parents’ old age. Feminist scholarship focuses particularly on the gendered 
character of how families distribute labor within the household and between 
domestic and nondomestic spheres in order to provide goods and services 
necessary for social reproduction (Lamphere 1987b). Both the family strat- 
egies and social reproduction literatures see families not as passive agents of 
sociohistorical transformations but as actors consciously working toward their 
own goals. And both approaches argue that family strategies have wide- 
ranging demographic and economic implications for the society as a whole. 

Whereas the microeconomic approach to family strategies tends to treat the 
family as a unit, attention to gender immediately raises the question whether 
family decisions and decisionmaking equally represent the needs and interests 
of women and men. An intense empirical and theoretical debate in the 
literature on the nineteenth century working-class struggle for a “family 
wage” for men, for example, argues forcefully that family strategies are as 
much shaped by the organization of gender, including gender inequality. 
inside and outside the family, as they are by class and race relations and 
historical development. (On the impact of class, see Brenner & Laslett 1986; 
on the impact of race on working class family strategies, see Glenn 1985). 
Further, where much of the family strategies literature has tended to analyze 
changes in the organization of social reproduction as an aggregation of 
microlevel decisions by individual families, the historical literature we review 
has illuminated the relationship between women’s and men’s participation in 
collective action—in trade unions, in political and social movements—and 
the strategies that families pursue (Laslett 1981). 

Feminist thinking about the relationship of women to social reproduction 
treats as problematic what heretofore has tended to be regarded as un- 
problematic: the gendered division of labor, the shifting boundaries through 
which social reproduction has been defined as a private or a public 
responsibility, the commodification (and decommodification) of the work that 
it involves (see Glazer 1984). In analyzing variation and change in these 
areas, research on nineteenth and early twentieth century Europe and North 
America has focused on the emergence, institutionalization, and reorganiza- 
tion of the male breadwinner/female housewife family. In what follows, we 
present an overview of this scholarship and argue (a) that from the late 
eighteenth through the early twentieth centuries, changes in the organization 
of social reproduction, including the family system, have to be seen as the 
outcome of conflicts between social groups with competing interests, and (b) 
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that central to these conflicts was the question of woman’s power and 
authority in the home and outside of it. When we look not only at what men 
did and why, but at what women did and why, paying attention to differences 
of class and race/ethnicity, feminist scholarship has demonstrated the ways 
that women both shaped and were shaped by the organization of social 
reproduction. 


SOCIAL REPRODUCTION, GENDER, AND SEPARATE 
SPHERES 


In both Europe and North America, the notion that men and women inhabited 
separate spheres, governed by opposed but complementary norms and values, 
has been central to the organization of gender in modern industrial society. 
The turn of the nineteenth century saw the emergence of separate spheres as 
an ideology, and throughout the century separate spheres were institutional- 
ized as the culturally and socially hegemonic form of gender relations. By the 
beginning of the twentieth century, both the ideology and actuality of separate 
spheres began to be reorganized. (See Quataert 1985, however, for an earlier 
timing in Central Europe.) These developments are tied to changes in the 
organization of both production and social reproduction. 

In the preindustrial economy based on household production, home and 
commerce, social reproduction and production, women and men, children 
and adults, were located in the same world of daily experience. As property- 
holders and legal representatives of their households, men directed household 
labor and monopolized political, religious, and domestic authority. Fathers 
had broad responsibilities toward children, including religious and moral 
education and training for work, especially, although not exclusively, for sons 
and male apprentices. While also taking primary responsibility for the daily 
physical needs of household members, caring for young children, and nursing 
the sick, mothers made crucial contributions to the productive activity of the 
household. 

In the domestic economy, men and women could regulate their productive 
labor in workshop or field to provide for the household’s daily needs— 
nursing infants, supervising young children, apprenticing older ones. (Medick 
1981, Jensen 1986, 1988; for Black family and work patterns under slavery, 
see Jones 1985). The development of industrial capitalism fundamentally 
undermined the unity between production and social reproduction that had 
previously existed. Losing control over productive property and the work 
process to capitalist employers, families lost the capacity to coordinate pro- 
ductive and reproductive tasks. It is important to recognize that the rise of 
industrial production per se did not create the conflict between the demands of 
production and those of social reproduction. While coordinating the two kinds 
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of labors would take a different form in an industrial economy—on-site child 
care, dinners provided at work and school, flex-time, reduced work hours for 
parents, etc—the technical requirements of industrial production do not rule 
this out. It was: not the rise of industrial production, but its class character, 
which posed the difficulty, i.e. the pressure toward accumulation which took 
on a particularly unregulated form in the nineteenth century. Although the 
impact of the market on the household varied by class, by race, and by 
economic sector, families-faced a new set of constraints as they pursued the 
tasks of securing subsistence and caring for family members. 


The Emergence of Separate Spheres in the Bourgeois Family 


The new propertied class of industrial entrepreneurs and the new middle class 
of independent professionals and salaried employees were the first to develop 
a family household, defined by its difference from the world of production.” 
(Davidoff & Hall 1987, Hausin 1981, Smith 1981, Ryan 1981). Men were no 
longer the organizers of family labor but were instead the providers of income 
on which the household survived. Women took increasing responsibility for 
the education and training of children but did not contribute to family enter- 
prise. Along with this change in household organization, ideas about 
childrearing, family relationships, and the characterological differences be- 
tween men and women were transformed. The home was defined first, as a 
haven from the uncertainties, commodification, and calculation of com- 
mercial life and second, as the locus of social and personal. morality. Maternal 
love and feminine virtue replaced patriarchal authority and religious doctrine 
as. the basis for childrearing. The ideal of the moral mother and the claim that 
it was no longer men but women who were endowed with ethical superiority, 
were embodied in beliefs that celebrated women’s piety, purity, and 
domesticity. Men, in contrast, were seen as aggressive, competitive, sexual. 
(Although this essay privileges the perspective of women, about whom most 
of the recent literature has concentrated, it is of course the case that images of 
masculinity were also affected by changes in class and gender relations in the 
nineteenth century. The literature on men as gendered, historical actors is just 
beginning to be available. See Kimmel 1987, Rotunda 1983.) 

Although motherhood had always been a core feature of femininity, in the 
nineteenth century motherhood came almost exclusively to define middle- 
class women’s activities, while mother love. took on a sacred quality. This 
“cult of true womanhood” was constructed by middle-class women as well as 
imposed upon them. Indeed, Lewis & Lockridge (1988) suggest that defining 


?A literature on upper class women and families in the late eighteenth and nineteenth centuries 
is just beginning to emerge, particularly in studies on the history of education and socialization. 
Such a literature will make it possible to understand more fully the relationship of class to the 
ideology and institutionalization of separate spheres in the nineteenth century. 
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themselves as fragile was one of the ways gentry women in late eighteenth 
and early nineteenth century Virginia were able to exercise some control over 
the number of children to whom they gave birth. 

Overwhelmingly, married women in middle-class households did not work 
for wages outside the home. But even when husbands and wives both wanted 
to pursue careers after marriage and their efforts were supported by the state, 
as was the case for “dual career” teaching couples in late nineteenth and early 
twentieth century France, it was extremely difficult to do so, in large measure 
because of women’s low wages and blocked career opportunities (Moch 
1988). 

From the early nineteenth century in the United States, urban middle-class 
women argued for their participation in the public realm in the name of their 
maternal duties (Hewitt 1984; Ryan 1981). Allying themselves with ministers 
and moral reformers, they established voluntary organizations (mothers’ asso- 
ciations, moral reform societies, etc) through which they sought to promote 
their claims for greater power. The “cult of true womanhood” served to 
legitimate women’s demands for higher status and greater control over chil- 
dren and household at the same time that it justified women’s exclusion from 
the economy and reinforced their dependence on men. Whether or not the 
nineteenth century reorganization of gender relations represented a loss or 
gain for middle-class women is a subject of some debate in the literature 
(Degler 1980:150). While it is true that the consequence of the new gender 
ideals was to solidify women’s identification with domesticity, it is also true 
that as the nineteenth century progressed, middle-class women used the dual 
spheres ideology to justify broad interventions in the public realm and to 
further challenge male prerogatives within the family. (For an account of how 
black middle-class women reformers differently adapted the dominant gender 
ideology to their own purposes, see Aptheker 1982, Giddings 1984). 

In attempting to make the home their own domain, middle-class women 
engaged in a range of social movements and campaigns that drew on and 
helped to institutionalize a new construction of male and female sexuality. In 
social purity campaigns, they used the belief in women’s “passionlessness” to 
argue that prostitutes could only be unwilling victims of men and to demand 
married men’s sexual fidelity and restraint. Men had a right to their authority 
oyer women only if they used it to protect rather than exploit. (See Pleck 
1987, Gordon & DuBois 1983 for the United States; Walkowitz 1983 for 
Britain; Evans 1976 on prostitution reform in Germany.) 

Temperance crusaders in the United States called on beliefs in male brutish- 
ness and female purity to gain a sympathetic hearing for the drunkard’s wife. 
While ostensibly campaigning to protect working-class women from drunken 
working-class men, middle-class women implicitly challenged all men’s right 
to beat or sexually assault their wives. Although only a feminist minority of 
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temperance reformers supported women’s right to divorce, by the end of the 
nineteenth century many in the United States were calling for women’s right 
to separate maintenance from husbands who sexually or physically abused 
them (Pleck 1987). Campaigns for “voluntary motherhood” in America sim- 
ilarly emphasized women’s asexuality in arguing that women did not have a 
duty to submit to their husbands’ demands for intercourse (Gordon 1982). 


Separate Spheres and the Working Class 


Marital relationships, motherhood, and sexuality were experienced differently 
for nineteenth century women in the working class. Working-class men and 
women did increasingly inhabit separate spheres, but unlike middle-class 
wives, working-class women’s sphere included a range of activities clearly 
crucial to the family’s economic survival. These women worked in a homoso- 
cial context—sharing networks of women kin and neighbors and laboring in 
predominantly female workplaces. Working-class marriage appeared to be 
more an exchange of complementary labors than an exchange of material 
support for nur.urance and emotional services (Ross 1983, Jones 1985). 

Working class women’s sexual behavior also departed from the middle- 
class model of sexual purity. Young unmarried working girls in America, 
both earlier (Stansell 1986) and later (Peiss 1986) in the nineteenth century, 
used their sexuality to negotiate with men for promises of marriage, for 
clothes, meals, or other economic benefits, and also just for recreation. As 
Peiss (1986:110) puts it, 


Working-class women received conflicting messages about the virtues of virginity in their 
daily lives. Injunctions about chastity from parents, church, and school might conflict with 
the lived experience of urban labor and leisure. . ... Women who had steady male friends 
they intended to marry might justify premarital sexual intimacy. . . . Other women fully 
bought into the culture of treating, trading sexual favors of varying degrees for male 
attention, gifts, and a good time. 


Although varying by time and place, working-class and immigrant house- 
holds could not achieve the same kind of polarity between the home and 
workplace that defined separate spheres in the middle class. Even by the end 
of the nineteenth century only highly skilled and unionized male workers 
earned sufficient wages to be the sole providers for their families. And the 
male breadwinner family has never been dominant within the black communi- 
ty (Jones 1985; Lebsock 1984). Where white women were increasingly 
defined by their maternal/domestic roles, the larger society defined racial/ 
ethnic women predominantly as laborers (See Glenn 1985; Albers & Medi- 
cine 1983). 

In the working class, a good wife was not only an efficient manager and a 
skilled domestic worker, she also contributed to family income (Tilly & Scott 
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1978, Quataert 1986, Franzoi 1985, Jensen 1980, 1988, Levine 1984, 
Morawska 1985). However, married women worked in ways increasingly 
different from their husbands and their older children who sought full-time 
waged work. Married women’s employment—as home workers (which was 
quite extensive, particularly in Europe), seasonal] and casual laborers, taking 
in boarders, etc.—was definitively shaped by their domestic responsibilities. 
Daughters, like sons, contributed to the family economy, but with the ex- 
pectation that they would withdraw from full-time work outside the home 
with marriage or the birth of their first child (Lamphere 1987b). With the 
exception of domestic service, this pattern also characterized urban black 
women’s work, although black adult women were more likely than white 
women to work for wages (Jones 1985), while Mexican-American women 
and Chinese-American women worked as family laborers in agriculture and in 
marginal businesses. (Glenn 1985) 

Working-class men :may not have been sole breadwinners, but they were 
primary earners. While working-class women pooled resources through. kin 
and neighborhood networks, most single mothers were abjectly poor (Ross 
1986, Stansell 1986). And even by the late nineteenth century, particularly in 
the United States where public services and housing availability were expand- 
ing, and thus the conditions under which domestic labor was done had 
improved (Wright 1981), married women who worked full-time carried a 
tremendous workload (Roberts 1982, 1986; Jones 1985). 

Although working class men had fewer economic and political resources 
than did middle-class men to legitimate their claims to domestic authority and 
paternal power, however little they earned, their wives earned less. And 
although working-class women had fewer resources than middle-class 
women, they also used a version of separate spheres ideology to challenge 
their husbands’ power. The poorest working-class London families, Ross 
(1982) reports, experienced high levels of conflict between husbands and 
wives over men’s unwillingness or inability to provide. Women appealed for 
help from neighbors and took their husbands to court in the name of their 
rights (as wives and mothers) to their husbands’ wage. Stansell (1986) 
provides a similar account for working class New York families in the early 
nineteenth century. 

Whatever the reality of most families, the male breadwinner ideal appears 
to have become firmly established within the working class (Levine 
1987:160-91). From midcentury, the demand that men should earn a family 
wage (a wage sufficient so that neither their wives nor children had to work 
for wages) was expressed over and over again in English.and American 
working-class movements for trade union organization, protective labor 
legislation, and political reform. There is a wide debate in the literature over 
how to explain this development (Barrett 1980, Brenner & Ramas 1984, 
Alexander 1984, Rose 1986, Seccombe 1986b). Attention has focused partic- 
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ularly on the conflict between men and women workers, in which capitalist 
employers attempted to use young single women and married women 
homemakers to undercut men’s wages and to gain control over the labor 
process. Men generally, although not always (see Baron 1982; Taylor 1983, 
Sowerwine 1983), responded by attempting to bar women from their employ- 
ment and secure a family wage for themselves, rather than organizing the 
women and demanding equal wages for men and women. (Rose 1988; 
although see Jensen 1986 for a different dynamic in the French case.) 

Whatever the differences in interpretation, these analyses have demon- 
strated that working-class meanings of masculinity and femininity were cre- 
ated by men and women in conflict with each other and in shifting alliances 
over against capitalist employers. Gender as both relationship and identity 
was socially constructed in the workplace and the economy, not only in the 
family. In turn, gender relationships and identities shaped men’s and 
women’s activity as workers and trade unionists in the United States and 
Britain as they attempted to solve their everyday problems of survival (Milk- 
man 1979, Cockburn, 1983, Taylor 1983, Rose 1987). How men and women 
understood their needs and interests as gendered beings. reflected not only the 
culturally hegemonic middle-class family ideal, but also the strategies they 
adopted in trying to provide for themselves and their children (Stansell 1986). 
Strategic choices, however, were shaped by social, political, and economic 
constraints—most centrally, by capitalist class power and men’s greater 
access to political resources, their advantaged position in the public realm 
(Brenner & Laslett. 1986; Seccombe 1986b). 

By the end of the nineteenth century, the lineaments of the separate spheres 
doctrine, and the forms of gender relations that it expressed, were in- 
creasingly questioned. Just as the turn of the nineteenth century had witnessed 
the emergence of. separate spheres in the United States, the turn of the 
twentieth century in America saw its transformation. As female sexuality, 
motherhood, and marriage were redefined, women remained tied to children 
and family, but in anew way. These new constructions of gender were part of 
the reorganization of social reproduction—most importantly, the decline in 
average family size (especially in the middle class), the decline of household 
production, the rise in real wages and the expansion of consumption through 
the market (also predominantly in the middle class), and the shift of some 
responsibility for dependents from the family to the state. In the rest of this 
essay we review the contribution.of the new historical research in delineating 
and explaining this development. These materials suggest (a) that in contrast 
to both modernization and Marxist theory, the reorganization of social 
reproduction was due to sex as well as economics, (b) that women’s pursuit of 
their own, gendered interests was part of this process, and (c) that the 
institutionalization of these changes resulted in- new constraints as well as 
opportunities for women. 
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GENDER AND THE REORGANIZATION OF SOCIAL 
REPRODUCTION 


Procreation 


Historical demographers have described and attempted to explain the decline 
in fertility that accompanied the onset of industrial capitalism in Europe and 
the United States. (See, for instance, Coale & Watkins 1986, Easterlin & 
Crimmins 1985.) Although there are debates about the timing of fertility 
control within marriage and about explanations that consider the changing 
value and/or cost of children and processes of cultural diffusion, the family is 
usually depicted as the decision-making unit. Whether men and women had 
similar intentions and were equally able to put them into practice is rarely 
considered (Seccombe 1986a). Yet, as Lewis & Lockridge (1988) show in an 
analysis of letters and diaries written by Virginia gentry between 1760 and 
1830, women and men had different fertility “strategies.” Women feared most 
the pain and possibility of death associated with pregnancy and childbirth; 
although concerned about their wives’ well-being, men were likely to agree to 
family limitation only when their wives had entered their thirties and had 
produced a male heir or two. The evidence is clear about one thing: A desire 
for family limitation was voiced first among women and it was only after men 
as well as women defined pregnancy as a threat to women’s health that a 
decline in fertility can be observed. 

In an analysis of working-class British women’s letters and survey results 
about their reproductive behavior in the late nineteenth and early twentieth 
centuries Seccombe (1989a) shows that, these women had a concern about 
health remarkably similar to that of their early nineteenth century American 
sisters. Of course, in working class families, economic concerns were also 
central to family limitation in ways that are less evident among the Virginia 
gentry a century earlier. Nevertheless, in both studies, the route to fertility 
control involved negotiations with their spouses over the place of sexuality in 
their relationships. Although limited in coverage (Seccombe 1989b also uses 
data for Norway and Germany), these studies suggest that part of the reason 
fertility decline appeared first in the middle class was that middle-class 
women had more resources and authority in their marriages than did working- 
class women. The studies also clearly suggest the need for further research 
that can determine the intentions and possibilities of the historical actors 
whose behavior created the demographic transition. 


Sexuality 


While married women were limiting their childbearing, thus eliminating one 
of the major barriers to their increasing participation in public life, many 
middle-class women were not marrying at all. These highly educated “new 
women” of the 1880s and 1890s appealed to beliefs about women’s special 
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capacity for service to defend their participation in social welfare, political 
reform, and new professions (Evans 1989: ch. 7; Smith-Rosenberg 1985:245- 
96; Cott 1987). Many were outspoken feminists who established homosocial 
institutions (schools, colleges, settlement houses, etc) and life-long networks 
with other women that provided friendship, emotional support, and in some 
instances, erotic intimacy (D’Emilio & Freedman 1988:189~—194; Sklar 1985; 
Vicinus 1985). Since within the cult of true womanhood, nurturance, not 
sexuality, was the essential characteristic of womanliness, the new women’s 
preference for good works and “single blessedness” did not undermine their 
femininity. They represented an attractive alternative to marriage and mother- 
hood for economically independent middle-class women. 

By the beginning of the twentieth century, changing sexual attitudes and 
social beliefs were redefining the “new woman.” The new science of sexology 
created a discourse through which women could claim sexual pleasure—but 
only within the confines of heterosexual relationships (Smith-Rosenberg 
1985:245-296, Gordon & DuBois 1983). Sexuality was increasingly sepa- 
rated from procreation and was characterized as a means of pleasure and 
personal growth, of personality and energy, for women as well as men 
(D’Emilio & Freedman 1988). While women were given the right to sexual 
enjoyment, however, marriage and heterosexuality were increasingly defined 
as expressions of women’s maturity, happiness, and mental health. 

Both for the women of this generation who became public figures in the 
bohemian worlds of Greenwich Village, Paris, and Berlin and for those who 
lived more domestic lives within the privacy of their own families, sexual- 
ity—heterosexuality—had become more central to what it meant to be a 
woman (Grossman 1986; Trimberger 1983). In the hands of the male sexolo- 
gists, the politically active, career-oriented, unmarried new women of the first 
generation became “unnatural” and selfish, while the spinster and mannish 
lesbian were defined as perverse, as the embodiment of sexual pathology 
(D’Emilio & Freedman 1988: 121-130; Jeffreys 1985). Female autonomy was 
now defined in sexual not economic or political terms. As Smith-Rosenberg 
(1985:283) puts it: l 


Linking orgasms to chic fashion and planned motherhood, male sex reformers, psycholo- 
gists, and physicians promised a future of emotional support and sexual delights to women 
who accepted heterosexual marriage——and male economic hegemony. 


The young working-class women who peopled the city’s “cheap amuse- 
ments” exchanged their sexual favors for treats (Peiss 1986) and sought sexual 
pleasure outside of marriage were labeled deviant “hypersexuals.” (In con- 
trast, young black women’s sexual assertiveness was considered an expres- 
sion of the “natural immorality of the race’—-Lunbeck 1987). Young women 
who engaged in premarital intercourse out of poverty or with promises of 
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marriage could be seen as victims, while those who appeared to choose to do 
so, without marriage in mind, were expressing a sexual assertiveness and 
social autonomy that was considered appropriate for men only (Lunbeck 
1987). 


Consumerism 


Companionate marriage recast nineteenth century notions of sexual difference 
by constructing marital relations as an emotional partnership between in- 
dividuals presumed to have similar interests and needs. Similarly, mother- 
hood was redefined as a career to which women were naturally suited but for 
which they had to be educated. Women’s service to men and children in the 
family was defined no longer by a discourse based in religion and morality, 
but by one in psychology and mental health (Cott 1987). These changes were 
linked to the reorganization of women’s domestic labor in this period, most 
strikingly the shift from home-production to consumption and to new ideas 
about the home and housing. 

From the 1890s onward in the United States, middle-class women wanted 
simpler living space that would give them the time to pursue their myriad 
interests outside the home: in clubs and philanthropic work, in offices and 
department stores. A new style of home, the bungalow, met this demand 
(Wright 1981:158-76). And family-oriented housing for the working-class 
was also becoming available in company towns, planned communities, and 
the developing suburbs (Wright 1980, 1981:177—214). The home envisioned 
for both classes reiterated the male breadwinner/dependent wife and children 
family in which model homes would be matched by model housewives trained 
according to the tenets of home economics (Wright 1980:150—70). 

Earlier writings on the woman as consumer have seen her as a creature of 
advertising, the growth of retailing, and in general as capitalist producers’ 
attempt to create and control expanded markets in both the working and 
middle classes (Ewen 1976). Recent research which takes a more nuanced and 
focused look at the ways consumption was organized and shaped at the turn of 
the twentieth century-—particularly recent work on the department store—has 
viewed this development somewhat differently (Benson 1986; Leach 1984). 
Sales by mostly women salespersons, management by usually male manag- 
ers, and service to mostly women customers created a complex interactional 
stage on which class and gender relations were played out. This perspective, 
which sees the outcome as a result of negotiations between actors with 
sometimes similar, sometimes contradictory goals, suggests that department 
stores became a new kind of public space for both middle- and working-class 
women. Although, in the first instance, department store appeals were made 
more to middle-class women with discretionary income than to working-class 
women, the principle of free entrance without an obligation to buy and the 
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development of special sections for women with less money (the bargain 
basement) expanded the culture of consumerism to the working-class as well. 
Department stores became like museums of the modern, fashionable life, 
open without charge to all women (Leach 1984). 

Consumerism shaped women’s understanding of the new individualist 
demand for greater sensual enjoyment but was itself shaped by the practices of 
women—women customers and women salespersons—within a new business 
culture. In shopping, women were not simply victims of capitalist advertising 
but were taking an opportunity for autonomy and personal expression. 
Although shaped by both class and gender relations, the lure of consumerism 
reflected an emphasis on pleasure rather than duty in the pursuit of personal 
meaning and family responsibility. The nineteenth century version of domes- 
tic science, which.taught women how to be frugal housewives and careful 
mothers responsible for the health of their families was now joined by the lure 
of fashion and the belief that the consumer of the day had to be a “trained 
purchasing agent” (Benson 1986:79). 

The “new woman” was an ideal attainable by a very small minority at the 
turn of the century. Most working-class women were oppressed -by large 
families and small incomes. Reformers were concerned not so much about 
women’s burdens but about the adequacy of childcare which was, in turn, 
linked to concern about the size and quality of both the labor force and the 
fighting force, and about crime, delinquency, and radicalism (Jenson 1986; 
Gordon 1988). Despite women’s political activism in the context of the 
campaign for suffrage, which was occurring throughout this period, working- 
class women’s difficulties in accomplishing the work of social reproduction 
were discussed as individual rather than social problems that required no 
change in relations of economic dependency betweén women and men. 
According to women professionals in government agencies such as the Chil- 
dren’s Bureau and the Women’s Bureau, the best solution to the poverty that 
forced working-class women into the labor force was to increase men’s wages 
(Cott 1987:206). 

Rapp & Ross (1986) argue that the new consumer culture of the 1920s was 
part of the political backlash that undermined the feminist achievements of the 
first generation of new women on behalf of suffrage and political reform in 
general. Rather than specifically antifeminist, the early twentieth century 
culture coopted many feminist issues by linking personal identity and fulfill- 
ment with companionate marriage, heterosexual pleasure, motherhood as a 
career, and consumerism. As they put it, “life-style feminism supplanted its 
activist predecessor” (Rapp & Ross 1986:59). At the same time, however, the 
reorganization of social reproduction at the beginning of the new century 
institutionalized its own contradictions—articulating women’s entitlement to 
equality with men but then denying them the possibility of achieving it in 
practice. 


396 LASLETT & BRENNER 


Professionalization 

Just as the redefinition of motherhood and marriage responded to and con- 
tained women’s aspirations for equality, so too did the creation of gendered 
professions. The struggles to control areas of expertise and delivery of service 
not- only gendered the occupational structure but also helped to expand public 
responsibility for social reproduction. Medicine provides a particularly vivid 
example of the process. (For a broader historical discussion of women and 
professionalization, see Cott 1987, Chap. 7, and Glazer & Slater 1987.) 

Although doctors were primarily men, some aspects of doctoring were 
women’s work. Pregnancy and childbirth, in particular, had traditionally been 
under the care of midwives and/or women neighbors. In addition, in 
nineteenth century America a variety of medical reform movements tried to 
challenge the hegemony of elite-trained doctors over health care practices. 
The “professionalization” of medicine involved the successful reassertion that 
elite-training was central to medical work, and thus both women and health 
reform advocates were effectively excluded from its practice. Indeed, Smith- 
Rosenberg (1985:217—244) argues that doctors’ goal of professional hege- 
mony was one of the reasons for their campaign against abortion in the United 
States between 1850 and 1880 (see McLaren 1985 for England). 

The loss of access to abortion was particularly damaging for women since, 
as Seccombe (1989a) points out, women could initiate abortion without the 
co-operation or, indeed, the knowledge of male partners. As medicine be- 
came increasingly professionalized throughout the nineteenth century, the 
traditional association of female practitioners with childbirth and abortion was 
severely undermined. Yet the change was not wholly negative for women; 
Seccombe (1989a) also suggests that the medicalization of childbirth in the 
latter part of the nineteenth century may also have provided working-class 
wives with some authority that could be used to convince their husbands to 
exercise sexual restraint (as appears to have been the case among the Virginia 
gentry, see Lewis & Lockridge 1988). 

Male control over all aspects of medical practice in the United States was 
accomplished, in the end, by a reorganization of medical education at the 
beginning of the twentieth century. In the nineteenth century, women who 
wished to become doctors were trained almost exclusively in single-sex 
schools—the only avenue open to them since they were, by and large, denied 
entrance to men’s medical schools. The demand for more scientific medical 
education at the beginning of the new century accounts for the decline in the 
number of women in medicine in the United States in the first quarter of the 
twentieth century; few women were admitted to the professional training 
institutions that offered the new educational requirements. For those women 
who did become doctors, their inability to find places in private medicine led 
them to seek other outlets for their talents and interests. The development of 
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public health as a field in part reflects the energies and efforts of women 
doctors within it (Morantz-Sanchez 1985). 

The gendered history of the professionalization process is important for 
understanding the relationship between gender and social reproduction in the 
twentieth century in two ways. First, women’s exclusion from male fields as 
they became “professionalized” limited their activities to work that could, in 
some sense, be claimed as part of women’s domain. It was, thus, women in 
home economics, in public health, in social work, in government welfare 
agencies, (see, for instance, Ladd-Taylor 1986) that helped redefine in 
scientific terms the importance of women’s continued responsibility for child 
care and household maintenance. Secondly, the increased efforts of men to | 
limit the number of women entering the professions (through, for instance, 
setting quotas on the number of women admitted to medical schools— 
Morantz-Sanchez 1985: 248) were not resisted by a second generation of new 
women among whom a feminist consciousness was in decline. Thus, pro- 
fessional women in the 1920s in some measure legitimated the in- 
stitutionalization of the professions as a sphere of male authority and activity. 


The Role of the State 


The expansion of public responsibility for social reproduction in England and 
America occurred first through the development of charitable and government 
institutions which took over aspects of childrearing through compulsory 
education. More generally these assumed responsibility for children through 
orphanages, foster homes, reformatories, and the courts. The expansion of 
government responsibility for social reproduction can also be seen in the 
development of healthcare institutions such as hospitals and public health 
programs (Morantz-Sanchez 1985:282-315). In addition, home visitors from 
child welfare agencies, truant officers, juvenile court probation officers, as 
well as legislation against home work, ordinances against boarders, etc, 
expanded state control over the physical, cultural, and social conditions under 
which the work of social reproduction was accomplished in families (Lewis 
1986b). Modernization theory has treated state expansion as an inevitable 
product of industrialism and the inability of families to perform their tradition- 
al functions. Radical theory has emphasized class struggle (especially the 
mobilization of trade unions and political parties) and social control as the 
driving forces in state expansion (Quadagno 1987). Feminist analysis has 
added to the radical critique by demonstrating the importance of gender in 
shaping social policy (Jenson 1986; Pascall 1986). 

From the late nineteenth century through World War I, middle-class 
women played important parts in conceiving and implementing reforms. 
While in the United States, the major expansion of the welfare state occurred 
in the post-World War II period, policy frameworks were set at the turn of the 
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century. Pleck (1987), in fact, argues that the Progressive era in America was 
as important for institutionalizing structures of social policy toward women 
and children as was the New Deal in relation to organized labor and industrial 
policy. Indeed, it might reasonably be argued that the concept of social 
welfare in Britain and the United States was the creation of a century of 
women’s political organizing to expand social responsibility for the well- 
being of women and children (Sapiro 1986) 

But reform efforts were not gender neutral. Protective legislation was 
implemented for women but not men. Contributory, non-means-tested pro- 
grams such as pensions and workmen’s compensation, considered to be 
entitlements, were directed toward men as workers. Non-contributory, 
means-tested programs such as mothers/widows’ pensions, and maternal and 
child health care, understood as public charity, were directed toward women, 
primarily as mothers (Nelson 1989; Lewis 1984). 

State legislation and programs encoded and reinforced gender difference, 
and most crucially, women’s dependence (Sapiro 1986). Turn of the century 
income maintenance programs were structured by the assumption that men 
were or should be breadwinners while women were non-earning mothers; 
English national insurance legislation granted men, but not women, workers 
dependents’ benefits (Lewis 1984). In confronting the poverty of women- 
headed families at the end of the nineteenth and beginning of the twentieth 
centuries, middle-class reformers in Britain argued for mothers’ pensions, 
while at least some working-class women trade unionists and radicals argued 
for day care and minimum wages for women (Lewis 1984). The choice of 
mothers’ pensions reflected, it is argued, both class and gender interests and 
political power. 

Many employers were especially hostile to minimum wage legislation, less 
so to state subsidies, although they would have preferred to have neither 
(Abramovitz 1988). And the male-dominated trade unions were more willing 
to support women as mothers than as workers. (Although, see Jenson 1986, 
for a discussion of differences between French and English unions at the turn 
of the century.) While explanations diverge, most analyses of late nineteenth 
and early twentieth century legislation agree that rather than eclipsing the 
male breadwinner family, welfare state programs helped to maintain it (Boris 
& Bardaglio 1983, 1987; Zaretsky 1982). 

State expansion reflected structures of male domination, not only in the 
ways that policies tended to reinforce women’s economic marginalization and 
dependence on men, but also in the ways that women organized as women 
and used the state to challenge men’s domestic authority and to increase their 
economic resources. For example, through campaigns for state intervention, 
middle-class women carved out alternatives to marriage and motherhood by 
taking up work as providers of services to working-class families. The new 
“female” professions—school teaching, home economics, social work (Gla- 
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zer & Slater 1987), public health doctors (Morantz-Sanchez 1985:292~315) 
and nurses (Reverby 1987:109-110)—were defined by a reference to 
women’s special capacities for nurturance and altruism. 

Programs for ensuring child welfare did increase state surveillance and 
regulation of the lives of working class women (Abramowitz 1988, Vanderpol 
1982, Lewis 1986b). Working-class women often bitterly opposed the efforts 
of welfare reformers to enforce middle-class notions of appropriate childrear- 
ing, household standards, etc (Wright 1980:150-70). These interventions 
often threatened household income, for example, by removing boarders or 
preventing children from working (Gordon 1988; Ross 1986). In the working- 
class neighborhoods of late nineteenth century Boston, the Massachusetts 
Society for the Prevention of Cruelty to Children was notorious as “The 
Cruelty.” But working-class women also used these agencies to defend 
themselves against abusive husbands, to obtain additional economic re- 
sources, and to increase their domestic power. Gordon (1988) demonstrates, 
for example, that in the United States, interventions in families were often 
initiated by women (and children), not by case workers. And, in the interac- 
tion between social workers and their clients, working-class women strug- 
gled, sometimes successfully, to contest and modify the standards that social 
workers enforced. 

State policy outcomes reflect women’s political activity but also the differ- 
ent resources and power available to women and men of different classes and 
racial/ethnic groups. If women tended to support state policies that assumed, 
and in the long run reinforced, women’s exclusive responsibility for children 
and domestic labor, to understand why they did so, we have to consider their 
alternatives. Working-class women’s vulnerability to exploitation in wage 
labor, their marginalization in the trade union movement, the impoverished 
conditions under which homework had to be done, the importance of female 
kin networks in pooling resources for survival, the commitment of potential 
middle-class allies to separate spheres, the apparent inevitability of these 
conditions, left working-class women with little space to articulate strategies 
that did not assume a gendered division of labor. Although middle-class 
women had more room for political participation and more economic oppor- 
tunities than working-class women, their interventions on behalf of working- 
class women reflected their own limitations—-their exclusion from male 
professions and their dependence, for the most part, on husbands’ income. 


CONCLUSION 


Sociological categories have tended to reify the social organization of in- 
dustrial capitalism, reflecting its divisions into private/public, family/politics, 
emotional/instrumental, reproduction/production. In this intellectual uni- 
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verse, the theoretical concerns of family sociology, political sociology, and 
historical sociology rarely intersect, except perhaps in the study of social 
policy. Persistence and change in family organization is understood to be the 
aggregation of individual decisions (as in the family strategies literature), the 
result of system imperatives (as in the modernization literature), or the logic 
of capitalism (as in the Marxist literature). We believe that the concepts of 
social reproduction and gender allow for explanations that are more historical 
and dynamic, that recognize both human agency and social structure. It is in 
this way that history as a process of structuring (Abrams 1982) can be more 
fully understood. 

Recognizing social reproduction as a domain of necessary social labor, and 
gender as a fundamental dimension around which it is organized, focuses 
attention on how the work of social reproduction is distributed between 
women and men within the family and between the family and other in- 
stitutions. This socially constructed division of labor is an arena of social 
conflict whose outcome shapes the political, economic, and ideological con- 
tours of the society as a whole. In this context, family strategies are seen to be 
collective as well as individual; through political parties, trade unions, social 
movements, urban rebellions and food riots, families have attempted to ease 
the burdens of dependence (Kaplan 1982, Laslett 1981, Tilly 1971). More- 
over, families and individuals choose strategies and succeed or fail in them 
under circumstances constrained by both class and gender relations. 

In analyzing either individual or collective strategies, gendered interests 
that reflect gender inequality are central to the structuring process. From this 
perspective, the differential capacity of women and men, as members of 
different classes and racial/ethnic groups, to organize and accomplish their 
goals is understood to affect the range of alternatives available for the work of 
social reproduction, as in the appearance, form, expansion, and retrenchment 
of the welfare state. Further, in recognizing the dependence of both polity and 
economy on the gendered division of labor, and especially on women’s 
unpaid labor in the home, we can better understand the ways in which state 
policies and economic organization more generally reflect gender interests 
and capacities for collective action. 

In demonstrating that gender and the organization of social reproduction 
shape political and economic institutions as well as being shaped by them, the 
historical literature we have reviewed indicates the value of incorporating 
gender into macrosociological theory. In this regard, comparative historical 
research can make a particularly valuable contribution to theories of the 
relationship between gender, social reproduction, and other social domains. 
How, for instance, did the earlier institutionalization of state support for 
social reproduction in Europe than the United States affect the political 
mobilization of women as women in these different national contexts? Did the 
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relatively early decline in French fertility, and the subsequent state policies 
that encouraged women to continue working outside the home after marriage, 
affect the “strategy of solidarity” with women pursued by the French trade 
union movement, compared to the strategy of exclusion followed in England 
and the United States. And in turn, did the different relationship between male 
and female workers in France affect the fortunes of the political left there 
compared to elsewhere? Did the ideology of separate spheres and its in- 
stitutionalization through market or political mechanisms have the same effect 
on social policy in, for instance, France, Germany, Sweden, England and the 
United States? And how can answers to this question inform our understand- 
ing of developments in Eastern Europe and/or Third World countries today? 
How does variation in the history of gender relations inform us about when 
and why some institutions, cultures, and ideologies come to take on sexual 
meanings? Comparative historical research can help answer these kinds of 
questions and, thus, test the universality of our theoretical models. 
Historians tend to focus more on context and specificity while sociologists 
focus more on abstraction and generalization. This tension, however, can be 
extremely fruitful for historical sociologists since theoretical formulations 
developed in one historical instance can be tested and refined in the light of 
historical evidence from others. A similar intellectual contribution has been 
made by feminist scholars. Efforts to understand how gender and the organi- 
zation of social reproduction shaped historical processes provide the empirical 
data for testing ungendered theories of procreation and societal reproduction. 
For this reason, we think the historical research and theoretical approach 
discussed in this review are relevant to wide-ranging sociological interests. 
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Abstract 


As sociological research on China has accumulated over the past decade, 
perceptions of social change after the revolution have altered markedly. This 
essay reviews findings in the areas of economic development, community 
organization, bureaucracy, and social stratification that reflect these changed 
perceptions and that should be of wide interest in the discipline. 


INTRODUCTION 


In the past decade, the study of Chinese society has developed rapidly and has 
changed in fundamental ways. When Chinese studies were last reviewed in 
this publication (Whyte et al 1977), perhaps no more than three English- 
speaking sociologists were actively engaged in research on the country, and 
little published research was distinctively sociological in focus.! The dis- 
ciplined study of social stratification, mobility, and community organization 
had barely begun. A major part of the problem was the paucity of data and 
research opportunities: Work in China was not common until the 1980s, basic 
social statistics were unavailable, Chinese social scientists could not conduct 
independent research, and the mass media were information-poor. 

By the mid-1980s all this had changed. More than 10 active sociologists 


‘Two pioneers of the first generation were not active in China research at that time. Franz 
Schurmann, who authored an intriguing interpretation of party ideology as a component of 
Chinese political organization (Schurmann 1966), had by this time moved on to other interests. 
Ezra Vogel, author of influential studies of personal relations (Vogel 1965), political authority 
(Vogel 1967a), and bureaucratic behavior (Vogel 1967b), had tumed to the study of Japan and 
would not return to China research until the mid-1980s (Vogel 1989). 
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now specialized in China research. Publications multiplied rapidly—the vast 
majority based on work in Hong Kong that continued to come to publication 
into the 1980s.* Chinese government agencies began to collect and release 
comprehensive social statistics. North American scholars were being granted 
research access to different field sites (e.g. Henderson & Cohen 1984; Nee 
1985; Selden 1985, 1988; Siu 1989, Yang 1986, 1987; Zweig 1989). China’s 
social sciences revived, stimulating systematic social surveys and a keen 
interest in western theories and methods. This, in turn, has spawned a number 
of recent collaborative surveys with American China specialists and has 
drawn several sociologists into China research for the first time.* Chinese 
students have enrolled in doctoral programs here, sometimes bringing data 
with them, and have begun to contribute to the discipline (e.g. Lin & Xie 
1988, Xie & Lin 1986). 

These developments have greatly altered both scholarly perceptions of 
China, and the research agenda. Many of the claims made about social 
changes during the late 1960s and early 1970s, which shaped the research 
agenda of that time, have been discredited. By the mid-1970s, the main 
agenda was to document and interpret actual patterns of social change during 
the 1960s and 1970s, and to explain the many unintended consequences that 
flowed from Maoist policy. By the 1980s, China’s economic reforms, quite 
sweeping in rural areas, had pushed another research. agenda to the forefront: 
What would be the effect of a revived market economy and new forms of 
ownership and management on prior patterns of stratification and mobility, 
bureaucratic behavior, and community life? 

Much of this research has wider implications for generic sociological 
questions, but these findings are not yet widely known in the discipline. In 
this review, I seek to identify the most important findings, their broader 
implications, and the evolving research agenda in four subject areas: eco- 
nomic development, community organization, bureaucracy, and social 
stratification and mobility.* 


THE CHINESE PATTERN OF DEVELOPMENT 


In the 1970s, many scholars saw in China an alternative model of develop- 
ment for the third world. Here was a regime that proclaimed its independence 


*The 1970s were the heyday of monographic research based on extended stays in Hong Kong. 
But after 1980, as American researchers went to China for research in increasing numbers, the 
tradition of extended research stays in Hong Kong quickly died out. However, research con- 
ducted in China has not yet resulted in enough publications to have a major impact on the topics 
reviewed here. 

3For example, Peter Blau, Glen Elder, Alex Inkeles, Nan Lin, and Anthony Oberschall. 

_ ‘Limitations of space and in the author’s competence force us to neglect important de- 
velopments in the field of demography, which deserve a full and separate treatment. 
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from the world economic system, forsaking foreign investment and export 
strategies in favor of a course of self-reliance. This strategy produced con- 
sistent industrial growth for over two decades. It appeared to have eradicated 
mass poverty, raised life expectancy rapidly, and lowered rates of infant 
mortality. Moreover, this regime proclaimed its desire to reduce wage dis- 
tinctions, stress moral instead of monetary incentives, reduce income gaps 
between city and countryside, emphasize rural development programs, pro- 
_ vide full employment and extensive job and social security, and strictly 
control rates of urbanization. To many social scientists, some of whom traced 
inequality, poverty, and economic stagnation in the third world to dependence 
on foreign capital, China seemed a promising alternative. 

Some of the accomplishments claimed would later prove to be genuine. 
Increases in life expectancy and lowered rates of epidemic disease were 
indeed achieved rapidly, and in fact attained the levels of other Asian countr- 
ies that had much higher average incomes (Banister 1987, Eberstadt 1988). 
Industry, grain output, and gross national product had all grown at rates that 
put China in the top ranks of the low-income countries (Riskin 1987, Ch. 8). 
Especially in urban areas, there was an extensive social security system and 
nearly complete job security (Davis-Friedmann 1983; Walder 1986, Ch. 2; 
Whyte & Parish 1984). Rates of urbanization were strictly controlled, in part 
through the enforced relocation of secondary school graduates to the country- 
side, and urban unemployment was thereby curtailed (Bernstein 1977, Kirkby 
1985). Through extensive neighborhood organization and contro! of mobility, 
the urban blight and crime typical of cities in much of the world were not 
characteristic of China (Whyte & Parish 1984). 

By the late 1970s, however, this model of development had encountered 
difficulties. Respectable growth rates had hidden some serious and worsening 
problems. As China’s leaders reevaluated their strategy, they released exten- 
sive data about past performance. While grain output doubled after the 1950s, 
so did the population (due to a disastrous rejection by the government of birth 
planning on political grounds) (Banister 1987). Moreover, self-reliance in 
agriculture forced all regions into a grain subsistence economy, which led to 
poor cropping decisions, and which depressed traditional trade and cash crops 
that had earlier brought high incomes to many rural regions (Lardy 1983). As 
a result, per capita grain consumption actually declined slightly over two 
decades for the nation as a whole, As the quantity of food consumed declined, 
its quality deteriorated. Consumption of vegetable oils, fish, sugar, and fruit 
all declined markedly (Lardy 1983). The quality of the diet, as indicated by 
the percentage of caloric intake comprised of grain, dropped to a point that put 
China on a par with Ethiopia and Bangladesh (Eberstadt 1988, Ch. 7). By 
1978, the grain consumption of 20% of the rural population did not meet 
official Chinese subsistence standards; it had probably declined since 1949 
(Lardy 1983). Accomplishments in the eradication of illiteracy and in 
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raising primary school enrollment were about the same as in Indonesia, better 
than in India, but only half as good as in Taiwan and South Korea (Eberstadt 
1988, Ch. 7). 

In cities, real wages declined by almost 20% in the two decades after 1956 
(Walder 1984). Per capita urban housing space declined by a similar percent- 
age, while the existing housing stock deteriorated badly (Walder 1984, Whyte 
& Parish 1984). State policy had destroyed the small-scale private sector, 
leading to an extreme shortage of fresh vegetables and fruit, and of such basic 
services as retail shops, restaurants, and repair services. A wide array of 
foodstuffs and commodities, from pork and matches to radios and bicycles, 
were rationed, of poor quality, and in very short supply (Whyte & Parish 
1984). 

It had become clear that the Maoist strategy was in fact a variant of Stalinist 
forced-draft heavy industrialization. While Chinese peasants were not sub- 
jected to the political terror and forcible requisitions of those in Russia under 
Stalin,’ they were nonetheless exploited systematically in favor of urban 
industry. The prices peasants received for mandatory grain sales were kept 
low, and the prices they paid for manufactured items high (Lardy 1983; 
Riskin 1987, Ch. 10). The cost of basic consumer items, measured in 
quantities of grain, was much higher for China’s peasants than for those in 
other east Asian’ countries, even for the same brands of consumer items 
(Riskin 1987, Ch. 10). The gap in income and food consumption between city 
and country widened considerably. Investment in the rural infrastructure came 
largely from grain extractions and labor levies placed upon villages, not from 
the state budget (Lardy 1983). Though enjoying a privileged position relative 
to peasants, urban residents also suffered from a characteristic neglect of 
wage increases, consumption, and housing in favor of industrial investment 
(Walder 1984). In terms of labor hours, they also paid vastly more for 
consumer items and even many staple food items than did their counterparts in 
Taiwan, Japan, and even the Soviet Union (Klatt 1983). 

This. pattern of development exhibited the inefficiencies of the Soviet 
pattern. Industrial productivity had declined steadily from the 1950s. Output 
per unit of capital invested declined by some 25% from 1952 to-1978. This 
meant that almost all industrial growth was accounted for by the shifting of 
resources from other sectors into industry, and virtually none was due to the 
efficient use of inputs (Riskin 1987, Ch. 11). Overcentralized management, 
irrational investment decisions by bureaucrats, and poor labor motivation 
were the primary reasons for the failures in industry (Riskin 1987, Ch. 11; 
Walder 1986, Ch. 6). 


“In the Great Leap Forward excessive grain requisitions exacerbated a massive famine 
initiated by ill-advised party decisions about cropping patterns and harvesting (Bernstein 1984). 
Subsequent analysis of the 1982 census yields estimates of 20 to 30 million excess deaths from 
1959 to 1961 (Ashton et al 1984). 
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China especially suffered in comparison with its economically dynamic 
neighbors in East Asia. By the late 1970s, not only Japan, but also South 
Korea, Taiwan, Hong Kong, and Singapore had become industrialized and 
relatively affluent societies. Especially relevant was the comparison with 
Taiwan, whose development contradicted earlier theories of dependency and 
underdevelopment. Taiwan developed far more quickly than did China in 
terms of national income and living standards, while it matched or surpassed 
China’s accomplishments in income equality, health care, mortality decline, 
and fertility control (Amsden 1979, Barrett & Whyte 1982, Eberstadt 1988). 
While the Taiwan case contributed to the development of alternatives to 
dependency theory (Gold 1986), the comparison with China convinced many 
that effective state management of relationships with the world economy, and 
not withdrawal from it, was the proper strategy. 

The short-term impact of the post-1978 economic reforms highlights the 
retardant effect of the Maoist model. Since the return to family farming, and 
since the adoption of material incentives, management reforms, and petty 
capitalism in the cities, economic improvements have been striking. Annual 
growth rates for grain and a wide range of other foodstuffs more than doubled 
almost immediately and were sustained for six years. Increasing agricultural 
diversification and per capita output led to a dramatic improvement in diets 
(Riskin 1987, Chs. 11, 15; Lardy 1984). Real incomes have increased by 
more than 65% in cities and have more than doubled in the villages (Walder 
1987; Riskin 1987, Ch. 15). Consumption of basic and even luxury consumer 
goods have multiplied several-fold (Riskin 1987, Ch. 15). From 1978 to 
1986, urban housing construction matched figures for the first 30 years after 
1949 (Walder 1987), and rural housing was built at even higher rates (Riskin 
1987, Ch. 15). While the new strategy has encountered a number of difficul- 
ties, which are currently under investigation, it nonetheless highlights the 
weaknesses of China’s earlier approach. 

In light of the progress made by most other developing countries in the 
region from 1949 to 1979, China’s pattern of development is unenviable. It 
built a large heavy industrial base, but with massive waste and inefficiency, 
sacrificing incomes, light industry, and the country’s infrastructure. China 
made genuine advances in health care, life expectancy, literacy, infant 
mortality, and other indicators of quality of life, but these advances were 
matched or surpassed by many East Asian nations with market economies. It 
is often remarked that China’s accomplishments in this regard look good 
when compared with countries at the same level of per capita income. But this 
attempt to standardize comparisons is misleading, since China’s model of 
development was itself partly responsible for the growing income gap with its 
neighbors. China was able to equalize incomes to a relatively high degree 
within communities, but allowed regional disparities to balloon. Community 
equality did little to help the poorest fifth of the rural population, roughly 20 
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million of whom perished in the man-made famine of 1959-1961. In the late 
1970s this population still lived at bare subsistence levels, with diets that had 
not improved substantially over 30 years. 

Despite its poverty, China had begun the 1950s with a distinct advantage 
over many other developing countries: Its government had unusual organiza- 
tional capacities and a clear political will to implement ambitious develop- 
ment programs at the grass roots. The next 30 years saw that advantage turned 
into a disability as misguided policies held sway. Despite the evident suc- 
cesses of the 1980s, China’s development experience is far from over. In the 
next decade, researchers will be preoccupied with several questions. Will the 
economy be able to sustain the growth of the 1980s, or is it reaching the end 
of the easy gains that followed the rejection of past policies? Can the country 
control inflation, keep its foreign indebtedness in check, and continue to 
guarantee basic subsistence and the welfare of its poorest citizens, or will it 
experience a loss of macroeconomic control, enhanced income disparities, 
and a worsening lot for the poor? It is possible that a mismanaged Chinese 
economy will present the country with problems more intractable than the 
ones of the 1970s which were, after all, rectified with relative ease. 


THE STRUCTURE OF LOCAL COMMUNITIES 


Hong Kong research in the 1970s, relying heavily as it did on intensive 
interviews with emigres, focused overwhelmingly on the organization of 
urban workplaces, neighborhoods,and villages. With considerable unanimity 
this research, and increasingly research in the 1980s based on extended 
residence in China (e.g. Henderson & Cohen 1984; Gold 1985; Yang 1986, 
1987), documented a pattern of community organization characterized by 
limited mobility, broad dependence on workplaces for the distribution of 
goods and services, and extensive networks of personal loyalty, obligation, 
and mutual assistance. The Maoist model of development also had con- 
sequences for community life, generating a new pattern of social organization 
that differs considerably from other modern forms. 

The first striking feature of this pattern was a remarkable lack of residential 
and job mobility. In the 1960s and 1970s a strict system of household 
registration and food rationing tied peasants to their villages and urban 
residents to their neighborhoods (Parish & Whyte 1978, Whyte & Parish 
1984). This system effectively restricted migration to cities and allowed the 
authorities to enforce the relocation of 16 million youths to rural areas. Rates 
of residential mobility in urban China were much lower than in other develop- 
ing and advanced countries (Whyte & Parish 1984). While personnel turnover 
has yet to be measured precisely, except for marginal sectors, most employees 
expected to remain with their current employer through retirement (Walder 
1986, Ch. 2). 
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A ‘second notable feature of this pattern was the extent to which patterns of 
work and residence were combined. The production teams of rural com- 
munes, of course, completely merged the place of work and residence. Urban 
workplaces owned roughly 30 to.40% of China’s urban housing and were the 
primary source of new housing for their employees. Many large factories 
(especially on the outskirts of cities, hospitals, schools, and government 
agencies were almost self-contained communities, where large majorities of 
employees lived in factory compounds (Henderson & Cohen 1984, Whyte & 
Parish 1984, Walder 1986). 

These workplaces exhibited a striking pattern of sollective consumption. 
Funds from institutional budgets and enterprise earnings subsidized the deliv- 
ery of a broad range of services and goods to employees. New housing, 
medical facilities, schools, nurseries, retail shops, recreational facilities, and 
meal halls were commonly provided. In cities, rationed and unrationed 
consumer items, and where the employee lived in workplace housing, basic 
foodstuffs, were distributed through the workplace as well. 

Within these local communities, there was a nearly complete fusion of 
political and economic powers, and extensive political organization (Nee 
1983, Whyte 1974b). Urban workplaces of any consequence all had party and 
youth league organizations with representatives in each office and workshop, 
and propaganda, security, and personnel departments run by the party branch. 
Workplaces kept political dossiers on every employee (Walder 1986, Ch. 3). 
Every village had the same set of organizations, although its political organ- 
ization was less intensive and dossiers were kept only on such selected targets 
as former landlords (Parish & Whyte 1978). With few exceptions, Party 
members held all leadership positions. 

Officials in these communities therefore not only had a formidable political 
apparatus at their disposal, but they also had wide discretion in the distribu- 
tion of housing, consumer items, wage raises, and promotions. Several 
researchers have found that this bred a pattern of personal rule, whereby 
leaders used their discretionary powers to build loyal followings (Chan et al 
1985, Madsen 1984, Oi 1985, 1989, Walder 1986). It also provided a 
framework in which official corruption could easily grow. Moreover, the 
dependence upon administrative distribution at the workplace, along with 
official discretion in allocating goods and the material scarcity of the period, 
gave rise to an active subculture of personal ties, upon which people relied for 
the attainment of scarce goods, services, and opportunities (Gold 1985; 
Walder 1986, Ch. 5; Yang 1987). 

This pattern contradicts theories about the increasing differentiation 
and isolation of urban residents that proceeds with urbanization (Whyte & 
Parish 1984). It also.contrasts rather strikingly with earlier predictions that 
political regimes with this ability to penetrate the grass roots will atomize the 
population into a mass society (Walder 1986, Ch. 1). Instead, the pattern 
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seemed to work in the opposite direction, toward increasing community 
integration. 

While wide agreement exists about this characterization of community 
organization in the 1960s and 1970s, there is less agreement about what 
changes are accompanying the new economic policies of the 1980s. Some 
think markets and abundance are weakening dependence on the workplace 
and making it less necessary to rely on personal ties to obtain goods and 
services (Gold 1985, Vogel 1989). Yet others point to continuing lack of 
mobility (Davis 1988b) and the continuing strength of workplaces (Walder 
1986, Ch. 7, 1987), and note that markets are not antithetical to personal 
networks. ? 

The analytic question is the extent to which economic changes will trans- 
form community power, and the degree of economic change required to 
transform it. The countryside is a strategic research site for this issue, since 
collective agriculture has been dismantled and virtually all peasants are now 
deeply involved in markets. Some have found cadre resistance to these 
changes and have argued that they suffer a decline in their power and relative 
economic position (Nee 1988, Zweig 1986). Others have found that rural 
officials have been able to use their political power to become entrepreneurs 
themselves or patrons allied to the newly wealthy peasants (Oi 1989, White 
1987). While the evidence is apparently mixed, both generalizations evidently 
have some basis in fact. Future research will seek to specify those conditions 
that determine variation in the fate of community power. 

A final, currently important, question is under what conditions will markets 
erode this pattern of community organization in urban areas, or to what extent 
will this pattern of community power constrain urban markets or distort their 
workings. While it appears logical that greater labor mobility and material 
abundance will make citizens less dependent on the workplace, workplaces 
have become more active than ever in adding to their provisions of collective 
goods, and such features as workplace provision of housing and social 
security are proving to be major barriers to the development of a labor market 
(Walder 1987). Systematic research on these subjects, as change itself, is still 
in its early stages. 


BUREAUCRACY 


The antibureaucratic impulses of Maoism were of wide interest to researchers 
in the 1970s. The Cultural Revolution was merciless in its repudiation of 
bureaucrats: The state apparatus at all levels was thoroughly purged, large 
numbers were sent to perform manual labor for extended periods, the staffing 
of organizations was (initially) cut drastically, specialized skills and certifica- 
tion were denigrated in favor of political loyalty, whole classes of regulations 
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and procedures were abolished, and many agencies were disbanded or com- 
bined. Maoists also claimed to have installed a kind of participatory democ- 
racy, in which the masses were able to criticize leaders and be represented on 
revolutionary committees (Nee 1973, Whyte 1973). 

These efforts placed two questions on the agenda of the early 1970s: (a) 
Could a large administrative apparatus function effectively with a minimum 
of formal regulations and spécialized personnel? (b) Could an effective 
bureaucracy be infused with the practices of mass democracy? Typically, the 
Chinese case was perceived very broadly as one that questioned the inevitabil- 
ity of hierarchy in modern administration (Nee 1973, Walder 1979, Whyte 
1973, 1974a). 

The literature generated by these questions quickly became irrelevant, as 
research in the late 1970s made clear that the public representation of Maoist 
antibureaucratism corresponded poorly with organizational realities. The ini- 
tial staff reductions were temporary, as the purged bureaucrats eventually 
returned to their posts, but alongside a new generation of technically in- 
competent political loyalists. In reality, the bureaucracy was further bloated. 
Whole classes of regulations and procedures were abolished, but since power 
now rested with nonspecialists, delays, inaction, and evasion of responsibility 
became more widespread than before. With the abolition of regulations and 
the stress on political loyalty and demands for ideological conformity, the 
practice of mass democracy became an empty ritual, and arbitrary power 
became more fully concentrated in the hands of political officials at all levels 
of organizations (Walder 1982, Whyte 1989). 

Research in the late 1970s and early 1980s indicated that Maoism indeed 
established an alternative to rational-legal bureaucracy, but not a new one. 
Reflecting patterns of community organization, many observers began to 
describe an administration that was “patrimonial” or “neotraditional” (Kraus 
1983, Walder 1986). The destruction of established procedures and monitor- 
ing systems, the denigration of expertese in favor of loyalty in promotions, 
and the reliance on politically pure but untrained representatives to make all 
final decisions, bred a system of personal rule that easily degenerated into 
arbitrariness and corruption. By the late 1970s and throughout the 1980s, 
public commentary in China consistently declared nepotism, corruption, and 
other such “remnants of feudalism” to be the main obstacles to social and 
economic progress. 

At first glance, evolutionary theories that posit the necessity of rational 
organization for economic development might appear to be vindicated by the 
failure of the Maoist alternative and post-Mao calls for more rational bureau- 
cratic procedure. But the 1980s have shown that.corruption and personal rule 
are extremely difficult to eradicate. While it is true that more rational and 
impersonal forms of organization would better serve economic development, 
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they do not arise in an evolutionary fashion: intense political efforts at change 
have still been only partially successful. The widely reported spread of 
corruption in the 1980s is often blamed on market mechanisms and profit 
motives, but the research just reviewed indicates that a corrupt subculture was 
already firmly entrenched. This suggests that the upsurge of corruption is the 
result of new opportunities, not new orientations among bureaucrats. 

Researchers are now presented with a new agenda: To what extent will the 
continued infusion of market mechanisms cut into the scope of bureaucratic 
power; or to what extent will the pattern of personal rule divert and retard 
economic reform (e.g. Walder 1988, 1989)? 


SOCIAL STRATIFICATION AND MOBILITY 


Maoist egalitarianism naturally bred an intense interest in social stratification 
and mobility. As the first systematic data became available from surveys 
administered to emigre populations in the mid-1970s, China appeared indeed 
to be a relatively egalitarian society, but its standing relative to other countries 
depended upon which dimension of inequality was being measured, and with 
which set of nations it was compared. Later official data, surveys conducted 
by such international agencies as the World Bank, and Chinese surveys 
sometimes conducted in collaboration with foreign scholars, did not fun- 
damentally challenge these initial findings, but they did reveal the many 
respects in which certain inequalities were exacerbated by the Maoist pattern 
of development. Just as this pattern of stratification was being documented, 
however, China’s new economic model, which appeared to sanction greater 
inequalities, led to renewed speculation and debate about how these patterns 
would change. 


Income and Consumption 


In the 1970s, income distribution within Chinese cities was much more equal 
than in developing countries with market economies, and near the bottom of 
the range of income inequality for socialist bloc countries (Parish 1981, 1984; 
Riskin 1987, Ch. 10). Measures of inequality for Chinese urban incomes were 
less than half those of the low income countries of South and Southeast Asia 
(Riskin 1987, Ch. 10). Most of the variation in income in the late 1970s was 
probably due to life cycle effects (Davis-Friedmann 1985; Riskin 1987, Ch. 
10). While China’s flat urban income distribution was greatly facilitated by its 
ability to keep out the rural poor, many observers have nonetheless judged 
this to be ah impressive accomplishment (Parish 1984, Whyte & Parish 1984, 
Riskin 1987, Ch. 10). Moreover, the Chinese pattern appeared to afford fewer 
privileges to professionals and officials in housing and consumption of com- 
modities, and it showed lower overall correlations between income, commod- 
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ity consumption, and housing standards than did the other socialist states 
(Parish 1984). While the data for intravillage income distribution are less 
comprehensive, they indicate a similarly egalitarian pattern (Blecher 1976, 
Selden 1985, Riskin 1987, Ch. 10 Unger 1985, Vermeer 1982). 

However, later studies qualified these conclusions. It became increasingly 
clear that while income equality within cities and villages was quite pro- 
nounced, differences between rural regions and between city and countryside 
grew markedly, rather than shrinking, from the mid-1950s to the late 1970s. 
The urban-rural income gap grew from approximately 2:1 in the 1950s to 
anywhere from 3:1 to 6:1, depending upon whether state subsidies of urban 
housing and food consumption are included (Parish 1981, Whyte 1986). The 
gap in consumption of food grain, cooking oil, cotton cloth, and consumer 
durables widened strikingly (Lardy 1983, Riskin 1987, Ch. 10). Differences 
in incomes and consumption between rural regions also grew (Vermeer 1982, 
1988). By the late 1970s, interregional differences in caloric intake were 
greater in China than in India, Brazil, and Indonesia (Eberstadt 1988, Ch. 7). 
These interregional and rural-urban differences were considerably larger than 
those in other countries and offset levelling within communities. As a result, 
data on income distribution for the country as a whole show China’s degree of 
equality to be unremarkable. China was on a par with Bangladesh, India, Sri 
Lanka, and Yugoslavia; the poorest 40% of China’s population received the 
same percentage of national income as did India’s (Riskin, Ch. 10). While 
succeeding remarkably in keeping intracommunity wage differences very 
low, the Maoist model neglected the widening disparities between communi- 
ties. 

One major unanswered question concerns the large component of nonwage 
income in the form of collective consumption in urban workplaces. It is well 
known that the quality and availability of housing, medical care, and other 
services and goods distributed bureaucratically are likely to have varied 
significantly by the size and administrative status of the workplace. But this 
variation has yet to be measured carefully, and we do not know whether 
collective consumption serves to alleviate or exacerbate inequalities. Since 
this is one area where the greater prosperity and market distribution of the 
1980s might produce significant changes, this is an important lacuna. 

With the end of collective agriculture, and the gradual expansion of the 
private sector in urban areas, the initial presumption among many was that 
China had entered a new era of increasing income inequality (e.g. Whyte 
1981). There are already clear indications of increasing income disparities 
within villages (Riskin 1987, Ch 11). There is also evidence of increasing 
dualism in the urban labor force, and new age-related inequalities (Davis 
1988a,b). In certain respects, however, inequalities are lessening. The wage 
gap between urban and rural areas, for example, has been cut significantly as 
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rural incomes have soared in the wake of the reforms. Official data show this 
gap to have shrunk by around 25% since the late 1970s, although there is 
some speculation that increased subsidies of urban living standards may offset 
much of this (Whyte 1986, Lardy 1984). Moreover, consumption of food- 
stuffs and consumer durables, construction of new housing space, and bank 
savings have all grown faster in rural than in urban areas (Whyte 1986). 
Income differences between rural regions have been found to grow along 
some dimensions and to shrink in others (Selden 1988, Vermeer 1988). One 
survey of urban wage earners showed that the restoration of bonus payments 
and regular wage raises reduced income differences by some 25% between 
1976 and 1986, primarily because wage raises, restored after more than a 
decade of austerity, were granted in a manner that levelled incomes (Walder 
& Ruan 1988). This subject will be important in the coming years. Con- 
ceptually, it bears directly on broader questions about the roles of markets and 
bureaucracies, in different historical settings, in maintaining or alleviating. 
social inequalities. | 


Gender 


China’s record on gender inequalities in the Mao era does not appear to be 
better than other developing countries. Researchers are now fairly unanimous 
in their low estimation of the progress made. Moreover, although systematic 
research has yet to be conducted, there are preliminary indications that the 
status of women may deteriorate further as a result of economic reform. 

Surveys of emigres in the 1970s indicated that in terms of incomes and 
occupational segregation in urban areas, China’s pattern of gender inequality 
did not differ significantly from that of economically advanced nations or its 
socialist counterparts (Whyte 1984). Data released in the 1980s would show 
that the gender gaps in primary school enrollment and rates of literacy in 
China were double those of Hong Kong, Taiwan, and South Korea, higher 
even than Indonesia, and about the same as in India (Eberstadt 1988, p. 167). 
Vastly expanded participation in the labor force did not reduce gender dis- 
advantages; it only created new and more complex problems as working 
women were still widely expected to assume most of the burdens of household 
and family (Whyte & Parish 1984). 

The key problem, most analysts now suggest, was not to increase women’s 
participation in labor, but to transform the family. In rural areas, despite such 
early reforms as the marriage law which reformed the worst abuses of the old 
family system, rural socialist institutions did not challenge the patriarchal, 
patrilocal, and patrilineal character of the family. Several scholars have 
argued that these family characteristics were not challenged because they 
would have undermined peasant support for the party and made plans for 
economic development more complicated. Instead, collective economic in- 
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stitutions in the countryside were premised upon a higher estimation of the 
accounting price of a male’s “labor power,” male headship of the family, the 
bride’s residence in the husband’s household, and the inheritance of residence 
rights, the household and its effects, and political statuses through the male 
line. Rural socialist institutions, in other words, reinforced the traditional 
family system and its valuation of women (Andors 1983, Johnson 1983, 
Stacey 1983, Wolf 1985). Research on the rural family in the 1970s supports 
this depiction of rural social life (Parish 1975, Parish & Whyte 1978). 

Research in China in the 1980s has found that traditional notions of proper 
women’s roles are widespread and strongly held in rural and urban China 
among both men and women. Most striking has been the realization that 
traditional beliefs about the inherent biological, intellectual, and even moral 
inferiority of women are also popular and can even be found among university 
women (Wolf 1985, Honig & Hershatter 1988). The revival of traditional 
wedding practices since the reversion to household agriculture, and the 
apparent monopolization of income opportunities outside of villages by male 
members of the household, have led to predictions that the status of women 
will suffer further as a result of the economic reforms. One study of the urban 
labor force indicates that gender segregation is becoming more pronounced 
(Honig 1988). We await further systematic studies. 


Social Mobility 


Surveys. of emigres indicated that in the 1960s and 1970s, there was remark- 
ably little intergenerational transmission of status. Already in the early 1960s, 
the rates of status transmission from fathers to offspring were considerably 
lower than in such countries as Poland and Finland (Parish 1984). Moreover, 
each successive age cohort exhibited less variability in status, showing that 
intergenerational transmission of status was lessening through time. By the 
late 1960s, and through the early 1970s, however, such status transmission 
stopped entirely. This was due to drastic measures: In the Cultural Revolu- 
tion, schooling was curtailed after secondary school, and students were sent 
for indefinite terms to the countryside. Universities did not reopen until the 
early 1970s. These policies attained a near total equality of opportunity, but 
only because with the closing of schools and the freeze on wages and 
' promotions, the process of status attainment was simply halted for the youn- 
ger generation (Parish 1984). 

Throughout this period, and into the late 1970s, clear-cut ascriptive 
categories, politically defined, had a systematic impact on life chances. The 
entire population was categorized into statuses based on the occupation or 
political affiliations of one’s father or grandfather in 1949: The categories 
were inherited through the male line. Formerly exploited social classes, senior 
party members, and Red Army soldiers were to be favored in school admis- 
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sions, job assignments, and promotions. Former middle class professionals 
and intellectuals were in a neutral category. Former exploiting classes, and 
members of Nationalist party organizations, and relatives of political de- 
tainees suffered systematic discrimination (Kraus 1981; Whyte 1975, 1985). 

These categories had a systematic effect on educational attainment, 
although they were not entirely successful at affirmative action. One large 
survey of youthful emigres found that two groups were overrepresented in the 
better high schools: children of party officials and military officers, and 
children of educated middle-class background. The first benefitted from their 
ascriptive status; the second, from the family environment their educated 
parents were able to provide. Children of working-class and peasant parents, 
however, did not fare well in admissions to the better schools, or even in 
attaining admission to high school. The ascriptive categories acted as a cover 
for nepotism: the good labels worked in favor of the children of the powerful 
and important, who sent their children to the best schools from the outset, but 
not nearly so well for the children of the disadvantaged. The categories 
labelled “bad,” on the other hand, were very effective in blocking admission 
for the offspring of former exploiting classes and the party’s political enemies 
(Rosen 1982, Unger 1982). Similar advantages and disadvantages were found 
in career mobility during the period. | | 

Awareness of these ascriptive markers was so high among students and 
some workers that they had a major impact on patterns of friendship, and later 
the formation of political factions.in the red guard movement (Blecher & 
White 1979, Chan 1985, Chan, Rosen & Unger 1980, Lee 1978, Rosen 1982, 
Unger 1982, White 1976). Moreover, these markers were perceived as so 
central to life chances by adults that they became an important criterion in 
determining mate choice for females (Croll 1984). The systematic discrimina- 
tion against former landlords and rich peasants in rural areas was so severe 
that it was almost impossible for males of this status to marry: They had no 
chance of marrying outside their status, because women would not wish to 
subject themselves and their offspring to such discrimination; on the other 
hand, most women of bad status eagerly sought to marry a man with a better 
status (Unger 1984). 

There were also particularistic standards, again defined politically, that had 
a similar impact upon mobility chances. Those who served the party most 
actively through membership and activism in youth league and party orga- 
nizations received systematic preference in high school and college admis- 
sions, job assignments,.and career promotions (and people with favored 
ascriptive statuses received preference in joining and advancing within these 
organizations). High school students consciously adopted strategies to cope 
with these political criteria, and the early 1960s saw intense political competi- 
tion in the elite schools (Chan 1985, Shirk 1982). The pattern of youth league 
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and party membership bears this out: Rates of membership rise directly with 
years of schooling attained and are markedly higher in the better schools 
(Rosen 1982, Unger 1982). In the population at large, rates of party member- 
ship rise by educational level for post-1949 cohorts, and they rise directly by 
occupational status for all cohorts (Walder 1985). The first post-Mao analyses 
of status attainment based on Chinese surveys show that party membership is 
still a significant predictor of occupational status, though not as strong as 
studies of the 1960s seemed to indicate (Xie & Lin 1986, Blau & Ruan 1989). 

Evidently, China’s process of status attainment during the 1960s and 1970s 
was highly unusual. The sharp disjuncture between parents’ and childrens’ 
status.in the early 1970s, and the biased implementation of affirmative action 
via ascriptive statuses in the early 1960s, provide cases that should be known 
to every researcher who speculates about the limits of intervention in mobility 
patterns. Unlike research on status attainment in the United States, moreover, 
China research has been based on institutionally focused case studies. The 
long absence of survey data, and the obvious political standards, have made 
research on China more sensitive to the social and political processes that 
affect mobility. While there has been an effort recently to bring work in- 
stitutions back into the study of mobility in the United States, one still does 
not encounter the level of ethnographic detail common in China research. 

It is not clear what changes are now taking place in this mobility regime. 
The Maoist era of levelling is over, and competitive examinations and skill 
have a central place in new party policies regarding educational admissions 
and promotions.The ascriptive class labels were abolished and removed from 
individuals’ dossiers at the end of the 1970s. Political loyalty and activism, 
while not irrelevant, are no longer to be an overriding consideration. The 
opening of a small but thriving private sector in urban areas, and the reversion 
to family farming in the countryside, raise the possibility that property and 
wealth will play an increasing role in the intergenerational transmission of 
status, and political privilege less. 

A recent benchmark study (Blau & Ruan 1989) showed status attainment in 
urban China to be less constrained by father’s status and education than it is in 
the United States. There is no significant direct effect of either variable on 
sons’ status attainment; there is only an indirect effect from father’s educa- 
tion, which works through son’s education. This and other recent survey 
findings (e.g. Aie & Lin 1986) will allow us to chart more precisely than ever 
before patterns of change in China’s mobility regime. 

What will the new mobility regime look like? Some suggest that a competi- 
tive meritocracy is emerging (Shirk 1984). Others argue that the new demands 
for educational qualifications and competence will be woven into moderated 
demands for political loyalty. With rapidly rising educational levels, this 
argument goes, the party will less often have to sacrifice competence when 
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preferring the loyal and politically active (Walder 1985). With the new 
Opportunities to analyze mobility with systematic survey data, these questions 
are likely to be answered in the next decade. And in this new generation of 
research, analysts will be well served by the institutionally grounded un- 
derstanding of Chinese mobility processes that past studies have afforded. 


SUMMARY 


Social change in China during the first three decades after the revolution 
suggests an ironic observation. In one sense, the Maoist claim to have forged 
an alternative to western patterns of development and change, and the related 
academic claims that this challenged existing western theories, were entirely 
correct. But.the pattern of change that comprises this challenge is not the one 
that Maoists thought they had brought about, or the one that scholars debated 
early on. Nor were the theories challenged always the ones we originally 
thought. 

In the end, the Maoist model of development provided a striking challenge 
to those theorists of underdevelopment who had earlier applauded it. Isolation 
from the world system, and an autarkic, self-reliant strategy, led to enormous 
waste, retarded economic progress, and led to serious imbalances. Living 
standards lagged far behind many adjacent developing countries, overinvest- 
ment and waste prevailed in heavy industry, agriculture was irrationally 
organized, new inequalities widened as old ones were reduced, and the 
material position of the poorest péasants was not greatly improved. The © 
experience of 30 years—in both China and Taiwan—presents a challenge to 
those who would attribute poverty, inequality, and stagnation primarily to a 
developing country’s position in the world economic order, or who would 
counsel withdrawal from the world system as an answer. 

The same is true for the development of local Chinese communities: 
relatively self-contained, closed units in which there is a large component of 
collective consumption via direct provision of goods and services, extensive 
protection against economic insecurity, a fusion of political and economic 
powers, authority via personal loyalties, and a prevalent subculture of per- 
sonal alliances and mutual help. These developments represent decisive 
changes from a modern Chinese urban past marked by extensive job and 
geographic mobility, short job tenure, and extreme social insecurity. They 
contradict prevalent theories of urbanism, mass society (whether of the 
totalitarian or pluralist variety), and evolutionary theories of social differenti- 
ation. And they should stimulate a rethinking of the economic and political 
structures that facilitate actual empirical patterns of social differentiation and 
de-differentiation. 
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Where researchers originally thought that the issues raised by China’s 
bureaucratic experience had to do with the reduction of hierarchy and the 
infusion of democracy, the real trend turned out to be the fusing of patrimo- 
nial with modern bureaucratic forms. This development suggests a sequel to 
Michels’ iron law of oligarchy: perhaps the oligarchy that Michels saw in the 
socialist movement was only a precursor to a subsequent stage, in which an 
“iron law of paternalism” afflicts socialist parties that head centrally planned 
economies. These developments, and the realization that current efforts to 
transform the Chinese bureaucracy according to a more rational-legal model 
requires a tough political fight against entrenched interests, suggest a realm of 
problems to which the sociology of organizations has devoted little attention. 
They lead back to the original concerns of Weber and Michels. 

China’s pattern of stratification and mobility, which for 30 years was 
pervaded by politically defined ascriptive and particularistic standards, also 
provides a challenge to comparative research. There are a number of impor- 
tant variables in the Chinese case whose impact we have only recently been 
able to measure: collective consumption at the workplace (especially urban), 
political ascription, and political particularism. In this stratification system, 
income measures alone may not correlate closely with actual styles of life, 
and knowledge of institutional processes and individual behavior is necessary 
properly to conceptualize and measure the political dimension of mobility. 
Comparative studies of stratification may not automatically assume that pres- 
tige scores and income will fully capture social realities. And comparative 
mobility studies, which are usually preoccupied with comparing rates of 
mobility, will have to confront the political and institutional processes that 
may prove to be the most distinctive differences between nations. 

Finally, the economic changes of the 1980s present a number of challenges. 
To what extent can market forces and a private sector be combined suc- 
cessfully with central planning in the long run? What will be the impact upon 
China’s rural poor? Will these market mechanisms serve to widen income 
disparities and other inequalities, or will they serve to alleviate the inequali- 
ties inherent in China’s earlier bureaucratic system? To what extent will the- 
self-contained local community be broken down by the growing alternatives 
outside the collective structures? To what extent will these economic changes 
transform the bureaucracy in a rational-legal direction, and to what extent 
will the patrimonial nature of the bureaucracy block further change and di- 
vert new economic opportunities into official corruption? To what extent will 
these changes undercut political standards of mobility and make China’s 
mobility processes look more like those elsewhere? These questions will 
preoccupy researchers in the 1990s, and the answers should be of interest to 
us all. 
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COVARIANCE STRUCTURE 
MODELS' 
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National Institute of Mental Health, Bethesda, Maryland 20892 


THE COVARIANCE STRUCTURE MODEL? 


Over the last 16 years, since Karl Jéreskog first introduced the LISREL 
model', the covariance structure model has gained considerable popularity 
among serious researchers in many research and applied fields. This has 
happened for two reasons. First, covariance structure models depict in a 
natural way the “causal” models researchers have in mind when thinking 
about the processes they are looking at. And equally important, the computer 
software has simplified the estimation of the parameters of covariance struc- 
ture models, making it possible to estimate large, sometimes intricate, models 
using an uncomplicated syntax. 

With other methods of analysis, the ‘method itself often determines the 
features of the model, and these are thus nonintuitive to the researcher. A 
loglinear model for 10 or 15 observed variables can produce a staggering 
number of parameters, many without clear theoretical implications. A covari- 
ance structure model, on the other hand, can be represented by intuitive path 
diagrams that do not require math courses to understand. And the estimated 
coefficients generated by the computer software are directly analogous to 
regression coefficients in multiple regression (which in fact may be regarded 
as a special case of the general covariance structure model) and have identical 
substantive interpretations. 

Covariance structure models are particularly ‘sei in survey research 
where the number of variables can be large and the researcher envisions many 
“causal” connections among them. But the large survey also presents another 
theoretical problem that covariance. structure models can handle with ease: 


‘The US government has the right to retain a nonexclusive, royalty-free license in and to any 
copyright covering this. paper. 

?The name LISREL is a registered trademark owned by Scientific Software of Mooresville, 
IN, and the use of LISREL as a synonym for covariance structure models is to be discouraged— 
not least because this amounts to an advertisement for a particular brand of computer software. 
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multiple measures of constructs. Often the researcher is faced with either 
choosing one measure from among a set of variables or attempting to put them 
all into the model at once. Either way the researcher loses-—-any one measure 
is certain to be an imperfect version of the set of measures; but if they are all 
entered at once, serious multicollinearity can be introduced into the model. 
The covariance structure model provides a solution to this dilemma directly 
by allowing the researcher to incorporate the multiple measures as “in- 
dicators” of “latent” variables or constructs. 

The covariance structure model is attractive, then, for. its unified treatment 
of what were once several complex problems for the researcher. A common 
approach to the analysis of survey data involved first a reduction of the many 
measures found in the data set through some kind of factor analysis; next, a 
construction of an index or composite. measure; and then numerous multiple 
regressions. Covariance structure analysis combines all of these steps into one 
analysis. The gain is substantial. Maximum likelihood estimation is guaran- 
teed efficient, provided the model is correctly specified; and by doing it all in 
one step, correct statistical inference is possible. (In the above-mentioned 
traditional method, researchers have ignored the sampling variability in- 
troduced by estimating indexes, a fact that seriously degrades’ substantive 
conclusions in the traditional method.) 

In the last few years general discussions of applications of this model in 
substantive contexts have appeared in journals. See, for example, Godwin 
(1988) in marriage and the family, Rindskopf (1984) and Fleischman & 
Benson (1987) in education, and Huba & Harlow (1987) in child develop- 
ment. Textbooks devoted to covariance structure models have also begun 
appearing recently, and they are discussed in more detail in a later section of 
this chapter. 


Covariance Structure Modelling in Social Research 


Kohn & Schooler in their work on job conditions and psychological function- 
ing have exploited two powerful features of the covariance structure model: 
longitudinal as well as reciprocal relationships among variables. When data 
are available at two (or more) times on two variables, the researcher may be in 
a position to explicate the causal relationship between them as well as to 
explore their reliability across time. With additional exogenous variables a 
cross-lagged effect—i.e. a causal effect across..time of each of the two 
variables on each other as well as the contemporaneous effect—-may be 
identified also. Thus, Kohn & Schooler (1978) show that the effects of the 
“substantive complexity” of a job on a person’s “ideational flexibility” are 
significant and essentially contemporaneous, whereas the reverse effects are 
also large but longitudinal and not contemporaneous. In addition both vari- 
ables exhibited substantial reliability across time. 

Kohn & Schooler and their colleagues have made extensive use of the 
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covariance structure model, and the reader is referred to their work for further 
applications of this model to the study of work and occupations (Kohn et al 
1983, Kohn & Schooler 1973, 1978, 1982; Miller 1980; Schooler et al 1984), 
to cross-national research (Kohn et al 1986; Naoi & Schooler 1985; Slom- 
czynski et al 1981; Miller et al 1981, 1985; Miller 1984), and to education 
(Miller 1986; Miller et al 1985). Other applications in the work and occupa- 
tion area are Mortimer & Lorence (1979a,b), Lorence & Mortimer (1985), 
Mortimer et al (1986), Bielby & Bielby (1984), Fenwick & Olson (1986), and 
James & Tetrick (1986). 

In social systems an opportunity is presented for imaginative modeling that 
has been exploited by a number of researchers. An early example ts a model 
in Duncan et al (1968) which presumed that background influences on the 
educational and occupational aspirations of adolescents operated on the friend 
of the subject indirectly through the subject. This rules out a direct causal 
effect of these background variables on the friend identifying reciprocal 
effects between subject and friend, thus neatly providing a description of the 
effects of social ties. Another example of imaginative modeling is the work of 
Kandel et al (1988) who analyze homophily in dyads—subjects paired with 
best friends in adolescence and with spouses in later life; they use measures of 
each person in the dyad as indicators of.a latent variable, to evaluate the 
process of homophily across time. Hauser & Mossel (1987) disaggregate 
between and with family variance in the effect of status on educational 
attainment; they exploit information on at least two members of each family in 
a large sample. 

Many other papers have appeared in recent years that illustrate the basic 
features of the covariance structure model. Studies of youth and adolescence 
seem to form a very active area for the application of covariance structure 
models (Byrne & Shavelson 1987, Alwin & Thornton 1984, Lau & Klepper 
1988, Lindsay & Knox 1984). Simmons & Blyth (1987) explore the develop- 
ment of self-esteem in adolescents. In delinquency research we find Matsueda 
(1982), Matsueda & Heimer (1987), McCarthy & Hoge (1984), Kaplan et al 
(1986), Osgood et al (1988), and Piliavin et al (1986). 

Other papers appearing in the sociological literature using covariance 
structure models are Gottfredson (1981) who looks at race and educational 
attainment; Judd et al (1981) who are studying political involvement; Link 
(1982), a study of the effects of a mental illness diagnosis on status and 
income; Massagli & Hauser (1983), who are analyzing survey reports of 
indicators of socioeconomic status; Parker (1983) and Thornberry & Christen- 
son (1984) who look at the relationship of criminal behavior and un- 
employment; and Thornton et al (1983), who are analyzing sex-role attitudes. 

The most activity in the use and development of covariance structure 
models has occurred in psychology and is part of a long tradition in factor 
analysis (Bentler 1986, Mulaik 1986). The classic works in covariance struc- 
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ture models are in that tradition: Bock & Bargmann (1966), Lawley & 
Maxwell (1971), Jéreskog (1969, 1970). And that tradition continues with 
important recent work by McDonald (1985) and ‘Bartholomew (1987). 

These roots in factor analysis notwithstanding, psychologists and psy- 
chometricians are increasingly devoting their attention to the “confirmatory” 
style of modeling—a type of model in which the observed variables generally 
indicate only a single latent variable, as opposed to the “exploratory” type of 
model in which few claims are made in advance about which observed 
variable loads on which latent variable or factor. Foremost among the psy- 
chometricians advocating the confirmatory type of model are Peter Bentler 
and his colleagues, who have contributed substantially to the development of 
the method (Bentler 1980, 1982; Bentler & Weeks 1982; Huba & Harlow 
1987) and have also employed it in substantive investigations (Bentler & 
Peeler 1979; Bentler & Speckart 1981, 1984; Tanaka & Bentler 1983; Tanaka 
& Huba 1987; Tanaka et al 1988). 

In economics a developed theory of latent variable models exists (Aigner & 
Goldberger 1977, Aigner et al 1984), and a well-defined concern for measure- 
ment error (“errors in variables”) in structural equation models as well (Judge 
et al 1980). However, econometricians have not accepted the notion of 
“multiple indicators” of latent variables, and thus few applications of covari- 
ance structure models have appeared. Instead the structural coefficients in the 
model are identified either by using the redundant measures as instrumental 
variables in instrumental variable estimation, or by using: overidentifying 
restrictions generated by leaving exogenous variables out of equations in 
maximum likelihood estimation. 

A special issue of Child Development was devoted to covariance structure 
models and included leading psychometricians working on covariance struc- 
ture models (Gollob & Reichardt 1987; Hertzog & Nesselroade 1987; Huba & 
Harlow 1987; McArdle & Epstein 1987; Mulaik 1987; Tanaka 1987). In 
marketing research the use of covariance structure models has been encour- 
aged by the work of Professor Bagozzi (Bagozzi 1980, 1983) and his col- 
leagues (Fornell 1983; Fornell & Larcker 1981). Researchers in that field 
are beginning to respond with applications (Bagozzi &;Burnkrant 1985, 
Bagozzi & Van Loo. 1988, Achrol & Stern 1988, Churchill & Pecotich 
1982). 

In educational research as well, the development and explication of covari- 
ance structure models is being carried on (Anderson & Evans 1974; Anderson 
1978; Kroonenberg & Lewis 1982; Wolfle 1982; Wolfle & Etherington 1986; 
Fleishmen & Benson 1987; Rindskopf 1984; Cuttance & Ecob 1987; Marsh & 
Butler 1984). As a result applications are appearing in the educational litera- 
ture (Wolfle & Robertshaw 1983; Ethington & Wolfle 1984, 1986; Marsh 
1987; Page & Keith 1981; Pascarella et al 1986; Kuusinen & Leskinen 1988). 
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Texts on Covariance Structure Modelling 


At the time of Bielby & Hauser (1977) very little instructional material was 
available for the researcher interested in covariance structure models. To 
apply these models to data required that the researcher study the source 
literature, which was intimidating for those not sophisticated in statistical 
methods. Now, however, major textbooks on multivariate methods routinely 
include chapters on covariance structure models (Asher 1983; Bagozzi 1980; 
Fox 1984; Heise 1975; Kenny 1979; Lunneborg & Abbott 1983), and special 
volumes devoted entirely to these models have appeared (Long 1983a,b; 
Everitt 1984; Hayduk 1987; Loehlin 1987; Bollen 1989; Jéreskog & Sérbom 
1988; Kim & Mueller 1978; McDonald 1985; Saris & Stronkhorst 1984). The 
Long (1983a,b) volumes in the Sage series are currently the most widely used 
texts, though strictly speaking these two pamphlet-sized books are not large 
enough nor comprehensive enough to qualify as texts. Two textbooks, 
Hayduk (1987) and Loehlin (1987) have just appeared. The former contains a 
simplified description of covariance structure models and is designed for use 
primarily with the LISREL software. The latter text is aimed at psychologists 
and covers exploratory factor analysis, rotation, and topics of less interest to 
sociologists. 

Karl Jéreskog, who designed LISREL, the original software for the estima- 
tion of the parameters of covariance structure models, has released LISREL 
VII, the latest version of that software and has published a comprehensive 
textbook to go with it (Jöreskog & Sdrbom 1988). This book is more 
advanced than Hayduk (1987) or Loehlin (1987) and will be more useful to 
serious researchers. Bollen (1989) also covers advanced as well as elementary 
topics; a preliminary inspection of the manuscript (it has yet to be published) 
reveals a thorough and quite readable treatment of covariance structure mod- 
els. 

Hayduk (1987) will be popular with the unsophisticated LISREL user, but 
serious users will ultimately find this text inadequate. If one is committed to 
the LISREL software, then Jéreskog & Sérbom (1988) will provide the 
necessary background in covariance structure models. A serious user, howev- 
er, will not want to be committed to a particular software package and will 
certainly want exposure to as much of the most relevant statistical theory as 
possible. Only Bollen (1988) provides this level of material and therefore is 
the prime candidate for the standard text on the topic of covariance structure 
models. 


Software for Covariance Structure Modelling 


The standard computer software for estimating parameters of covariance 
structure models has been the various: versions of LISREL (Jéreskog & 
Sérbom 1978, 1981, 1984). The major statistical software packages have 
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avoided writing their own software and have instead purchased extant soft- 
ware. SPSS (SPSS, Inc., 444 N. Michigan Avenue, Chicago, IL 60611, 
312-329-3500) has purchased LISREL VII (Jéreskog & Sörbom 1988). 
BMDP (BMDP Statistical Software, Inc., 1440 Sepulveda Blvd Suite 316, 
Los Angeles, CA 90025, 213-479-7799) has purchased EQS (Bentler 1985). 
SAS (SAS Institute Inc., Box 8000, Cary, NC 27511, 919-467-8000) pur- 
chased COSAN (Fraser 1980) and asked Wolfgang Hartman to develop a 
revised and extended version to be called CALIS, which will be available 
sometime during 1989. Scientific Software, Inc. (1369 Neitzel Road, 
Mooresville, IN 46158, 317-831-6296), a company which distributes soft- 
ware products but does not have a comprehensive data analysis package, will 
be distributing LISREL VII, the latest version. They also are distributing 
LISCOMP (Muthén 1987), a covariance structure analysis program designed 
by Muthén specifically for categorical data (including dichotomous). 

Two other covariance structure programs for mainframe computers are 
available without charge. One of these may be obtained from Jack McArdle in 
the Department of Psychology at the University of Virginia in Charlottesville, 
Virginia. His program uses a model that he developed with McDonald 
(McArdle & McDonald 1984) called the RAM model. Another mainframe 
“freeware” program is available from the author of this chapter. This program 
is called MILS and is a much revised version of LISREL II. The disadvantage 
of any freeware program such as this is that it may not be generally as well 
documented as the commercial programs. The advantage is that you will be 
getting source code that can be modified. SPSS-LISREL, EQS, CALIS, 
LISREL VII, and LISCOMP all have microcomputer versions of their main- 
frame programs. Another covariance structure program called LINCS 
(Schoenberg 1988) has been written in the GAUSS programming language 
(Aptech Systems, Inc., 26250—196th Place SE, Kent, WA 98042, 206-631- 
6679) and is available only for microcomputers. 


CURRENT ISSUES 
Covariance Structure Modelling Challenged 


The development and growth in the use of covariance structure models 
described above has not been without some resistance. In two separate 
debates, one in an edited text, (Freedman 1985) and another in a special issue 
of the Journal of Educational Statistics (Freedman 1987), Professor Freed- 
man has forcefully urged the abandonment of the use of covariance structure 
models entirely. Moreover, he is only one of several insistent-and articulate 
critics of the application of these models to social science investigations (see 
Cliff.1983, 1987; Rogosa 1987; Bielby 1986; and Clogg 1984, for example). 
Professor Duncan is an important critic who had been one of the earliest 
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proponents of structural equation and covariance structure models (Duncan 
1975), but who more recently has been arguing instead for the application of 
log-linear and Rasch models (i.e. latent trait models) to social science data 
(Duncan 1984). 

Very briefly, the critics argue that the data to which the covariance struc- 
ture models are applied generally fail to conform to the required assumptions. 
The data used in social science research are most commonly comprised of 
categorical variables rather than the “metric” (i.e. continuous over the real 
line) variables for which the covariance structure model was designed. 
Moreover, there is rarely any theoretical justification for the assumption that 
the data possess a multivariate normal distribution, a critical assumption that 
permits the use of the maximum likelihood method. A third complaint is that 
relationships are almost always assumed to be linear. 

These are serious objections to the use of covariance structure models and 
they are insurmountable obstacles for many serious researchers. Duncan 
(1984) and Clogg (1984) have argued that investigators should turn to the 
log-linear model and to the latent class model (Haberman 1979; Goodman 
1974; Clogg & Goodman 1985) for the analysis of categorical social data. The 
latent trait model is another alternative for the analysis of dichotomous and 
categorical items. (Lord & Novick 1968, Andersen 1980, Bock & Aitkin 
1981). 


The Covariance Structure Model for Categorical Data 


Log-Linear and latent class models, which explicitly take into account the fact 
that most social science variables are categorical, are valuable additions to 
statistical methods and provide descriptions of social processes that cannot be 
obtained in other ways. There are some limitations, however, to the applica- 
tion of these models to social science data. Nonexperimental research general- 
ly entails a large number of uncontrolled variables, and as a result the 
researcher is usually compelled to specify complex models in order to avoid 
accusations of misspecification more serious than the maldistribution of the 
variables. Also, when the models are large and complex the special 
methods—latent class, latent trait, log-linear models—become numerically 
and mathematically intractable. The number of parameters to be estimated 
proliferate, and their interpretation becomes intricate and nonintuitive. 

The computer programs for these models, even in this time of nearly 
unlimited mainframe resources, can require vast amounts of either memory or 
time. And the numerical optimization methods become exceedingly sensitive 
to the condition of the data: if the data are sparse, multicollinear, or derived 
from complex sample designs—all of which characterize many data sets 
available to social science researchers—the optimization will deteriorate and 
the estimation may fail. Moreover, the likelihood surfaces that some of the 
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models (and this applies in particular to the latent class model) are pitted with 
local minima which trap the optimization in false solutions about which the 
user must be wary (see Haberman 1988 for a.possible solution to the numer- 
ical problems of latent class models). 

Covariance structure models are subject to some of these same problems. 
However, the degree to which they are affected is substantially less than are 
log-linear models. Researchers are confronted with a dilemma: they may 
choose a covariance structure model, which has an intuitively appealing form 
as a “causal” structure and has a manageable set of easily interpretable 
parameters, but which leaves the researcher who is analyzing a categorical or 
non-normally distributed data set open to criticism. Or they may choose a 
latent class model that makes fewer demands in terms of distributional 
assumptions, but severely limits the complexity of the model, thereby leaving 
the researcher open to further charges of misspecification. 

An acute awareness of these problems has prompted a number of people 
specializing in the development of covariance structure models to generalize 
the covariance structure model and enhance its features so that the basic 
model is retained while it becomes applicable to categorical and non-normal 
variables. A promising approach was developed by Browne (1982, 1984). 
First, Browne (1984) has shown that when the data are kurtotic, that is, their 
distributions are either peaked or flattened more than the normal distribution, 
the maximum likelihood estimates based on the normal distribution will be 
consistent, but the statistical inference will compromised. He then showed 
that a simple correction may be applied to the standard errors of the parame- 
ters and the chi-squared statistic for the goodness of the fit of the model using 
a multivariate coefficient of kurtosis (Mardia 1970; Mardia & Foster 1983; 
see also Satorra & Bentler 1989). Then he developed a weighted moment 
estimation method that incorporated information from higher order moments 
of the distributions of the data. This method has been implemented in the 
EQS, LISREL VII and LINCS computer programs. 

The value of Browne (1984) was realized with further work (Olsson 1979; 
Olsson et al 1982; Muthén 1984) on the computation of polychoric correlation 
coefficients (for categorical variables) and polyserial correlation coefficients 
(for mixtures of categorical and continuous variables). In 1981 Jéreskog and 
Sérbom introduced a version of their covariance structure analysis software, 
LISREL V, that could compute polychoric and polyserial correlation coeffi- 
cients (Jéreskog & Sérbom 1981). With this method it was possible to 
estimate parameters of covariance structure models with observed data that 
were categorical or mixtures of categorical and continuous. 

Each observed categorical variable was assumed to be the outcome of a 
latent variable with unit normal distribution. Olsson (1979) refined techniques 
(the first discussion of their computation may be found in Pearson 1901) for 
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computing the correlation of two such latent variables with each other 
(polychoric) and with continuous variables (polyserial). LISREL V’s applica- 
tion of this method, however, would often generate correlation matrixes that 
were negative definite, thus precluding analysis. Furthermore, a method for 
computing the correct standard errors of the parameters and the correct 
chi-squared statistic of goodness of fit was not available in LISREL V. 

Getting correct standard errors was solved by Muthén (1984) who, building 
upon work by Olsson (1979), showed that the variance-covariance matrix of 
the estimates of the polychoric and polyserial correlation coefficients could be 
used as weights in a generalized moment estimation procedure (Browne 
1984). The computation of these coefficients was further refined (Olsson et al 
1982), substantially improving the algorithms for computing the polychoric 
and polyserial coefficients. These new methods have been implemented in 
two new programs: LISCOMP (Muthén 1987), a computer program for the 
estimation of the parameters of covariance structure models for data that 
contain mixed categorical and continuous variables, and LISREL VII (Jores- 
kog & Sérbom 1988). 

The .Browne/Muthén procedure just described does not solve all the 
specification problems with respect to the analysis of categorical variables. It 
does point toward solutions to data analysis problems that will retain both the 
intuitive appeal and ease of interpretation of the covariance structure model 
and sufficiently correct specification to yield consistent and efficient estimates 
of the structural parameters. _ 


Misspecified Covariance Structure Models 


At the root of Freedman’s (1985, 1987) criticism of covariance structure 
models lies his claim that their application to real data is inherently ad hoc. 
That is, in no case is the investigator able to generate a fundamental theory 
that justifies all of the required assumptions of the model. And there doesn’t 
seem to be any likely future prospect of supporting the claim of a multinormal 
distribution that is required for the maximum likelihood procedure. Therefore 
covariance structure analysis will always remain an ad hoc approximation. 
First, this argument overlooks the fact that for many researchers an ad hoc 
approximation will do quite well. Many areas of research do not have 
well-developed theories, and often the investigators are engaged primarily in 
applied work. The covariance structure model provides a means for producing 
intermediate results that might be used either for practical applications.(what 
would be the effect of this or that manipulable variable on educational 
achievement, drug use, consumer behavior, etc), or as exploratory results for 
building theory (see Fienberg 1985 for a response to Freedman 1985). 
Second, the exploration and confirmation of underlying “causal” structure 
may not require the specification of the entire process. Parameter estimates of 
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covariance structure models applied to dichotomous data are consistent if not 
efficient. Consistent estimates may be produced under a wide variety of 
misspecification of disturbance distributions including heteroskedasticity due, 
for example, to complex sample designs. If the data set is large, consistency 
may be quite enough for the confirmation and development of theory. 

The key issue in structural modeling, in this point of view, is not es- 
timation—since asymptotic consistency will withstand a lot of mis- 
specification—but rather statistical inference and the testing of models. One 
approach to these problems has been outlined by Satorra & Bentler (1989) 
who describe procedures for generating robust estimates of standard errors 
and fit statistics which will be implemented in ‘EQS. ; 

Arminger & Schoenberg (1989) have developed methods for misspecified 
covariance structure models using results from Gourieroux et al (1984) and 
White (1982). They show that maximum likelihood estimation (called 
“pseudo-maximum likelihood” estimation) is meaningful when the theoretical 
distributions are misspecified. They then describe a method, similar to, but 
simpler than, Satorra & Bentler (1989), for computing correct standard errors 
from White’s “heteroskedastic-consistent” covariance matrix of the parame- 
ters (White 1980). 

They also develop a “Hausman-type” statistic (Hausman 1978; White 
1980) which tests for the consistency of the parameters. Thus, one tests for 
the meaningfulness of the parameters directly rather than for “goodness of 
fit,” which may not be a sensible test when the model is known to mis- 
specified. The Hausman-type test and the heteroskedastic-consistent covari- 
ance matrix of the parameters are calculated in the LINCS computer program 
(Schoenberg 1988). 


THE FUTURE 
Limited Observed Variables 


Work will certainly continue on covariance structure models for categorical 
data. Muthén’s computer program LISCOMP (Muthén 1987) already handles 
censored data, and we may expect that methods will appear for handling a 
greater variety of limited observed variables. The sample selection methods 
developed by the econometricians (Maddala 1983) may be extended to covari- 
ance structure models. 


Nonlinear Models 


Another critical area in covariance structure models ts the nonlinear model. 
Kenny & Judd (1984) describe a method for computing estimates of parame- 
ters of models that contain “interaction” or product terms involving latent 
variables. This method requires a computer program that permits nonlinear 
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constraints on parameters (COSAN and LINCS have this feature) and also 
requires the multinormality of both the observed and latent variables. This 
technique is not robust to the misspecification of the distributional assump- 
tion. 

Arminger & Sobel (1988) present a general nonlinear model that permits 
the specification of nonlinear relationships of any form between latent and 
observed variables as well as among latent variables. as method has not yet 
been implemented in computer software. 


Missing Observations 


Muthén et al (1987), Arminger & Sobel (1988), and Allison (1987) present 
methods for estimating covariance structure models using data that contain 
missing observations. These methods are all based on the multiple group 
approach (Jéreskog 1971) which is an option available in LISREL and EQS. 
The program LINCS applies the method described in Arminger & Sobel 
(1988) which essentially treats each case or observation as a separate “group”. 
Arminger & Sobel (1988) show how this method may also be used to analyze 
data from different data sources as well as to handle some types of omitted 
variable problems. 


IN CONCLUSION 


The covariance structure model will clearly be one of the important tools for 
the analysis of social data. Researchers dissatisfied by the limitations of other 
types of models in describing social processes will continue to turn to the 
covariance structure model for its ability to represent complex in- 
terrelationships wai a parsimonious set of intuitively interpretable coeffi- 
cients. 

Meanwhile progress continues in the treatment of categorical data in the 
covariance structure model. Satisfactory solutions presently exist for smaller 
models, and we may expect that these solutions will be extended to larger 
models in the near future. In the meantime evidence is appearing for a general 
convergence of all of the methods in a single omnibus model. Takane & de 
Leeuw (1987) show that item response theory (i.e. latent trait models) and the 
factor analysis of dichotomous variables (Muthén 1984) are equivalent. Tjur 
(1982) has shown that the latent trait model is also a log-linear model under 
certain circumstances thus connecting latent class models, a special case of 
log-linear models, to latent trait models. All of the methods for analyzing 
categorical data appear then to be simply reparameterizations of each other. 
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Vols. 31-32 (1986-1987)... eee ee ee es $31.00/$34.00 
Vol. 33 (O88) secs era Gata das $34.00/$38.00 
Vol. 34 (avail. Jan. 1989)........2000. $34.00/$38.00  Vol(s) Vol 
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Annual Review of FLUID MECHANICS 


Vols. 1-4, 7-17 (1969-1972, 1975-1985)....... $28.00/$31.00 
Vols. 18-19 (1986-1987)... 2... cece wc eee $32.00/835.00 
Vol. 20 GSS) a anra e a aaa $34.00/$38.00 
Vol. 21 (avail. Jan. 1989)..........06- $34.00/$38.00  Vol(s). ————— Val. 
Annual Review of GENETICS 
Vols. 1-19 (1967-1985)..........-..00-- $27.00/$30.00 
Vols. 20-21 (1986-1987). 2... cece ce ccees $31.00/$34.00 
VoL 22 (1988) si Wet eb aeiewere te tved $34.00/$38.00 
Vol. 23 (avail. Dec. 1989) ..........000. $34.00/$38.00 Vol(s)._. Vol 
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Annual Review of IMMUNOLOGY 


Vols. 1-3 (1983-1985)... .. cc cee cee eeee $27.00/$30.00 
Vols. 4-5 (1986-1987)... ... ccc nee eeeee $31.00/$34.00 
Vol. 6 (1988) s ccwiteehesewteadewiws $34.00/$38.00 
Vol. 7 (avail. April 1989) ............ $34.00/$38.00  Vol(s). ——— Vol. 
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Annual Review of MATERIALS SCIENCE 


Vols. 1, 3-12 (1971, 1973-1982).........-.-- $27.00/$30.00 
Vols. 13-17 (1983-1987)......0.c cece eee $64.00/$67.00 
Vol. 18 (1988) 2seccaseurewesesaatece $66.00/$70.00 
Vol. 19 (avail. Aug. 1989)............- $66.00/$70.00 Volls). ——— Val 
Annual Review of MEDICINE 
Vols. 9, 11-15 (1958, 1960-1964) 
17-36 (1966-1985)... .. cc eee ee eae $27.00/$30.00 
Vols. 37-38 (1986-1987)....... PENEI $31.00/$34.00 
Vol. 39 (1988) cc awe vara secretacws $34.00/$38.00 


Vol. 40 (avail. April 1989) ............ $34.00/$38.00  Vol(s). —— Val. 
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$ Prices Postpaid 
ANNUAL REVIEWS SERIES per volume Regular Order Standing Order 
USA & Canada/elsewhere Please send: Begin with: 

Annual Review of PSYCHOLOGY Vol number: «= Vernon 

Vols. 4, 5, 8, 10, 

13-24, (1953, 1954, 1957, 1959, 1962-1973) 
26-30, 33-36 (1975-1979, 1982-1985)....... $27.00/$30.00 

Vols. 37-38 (1986-1987)... .. 6. cece ee eee $31.00/$34.00 

Vol. 39 CIQBRY «cic aE ENER $34.00/$38.00 

Vol. 40 (avail. Feb. 1989).........0005 $34.00/$38,00  Vol{s}. L — Vol 
Annual Review of PUBLIC HEALTH 

Vols. 1-6 (1980-1985). 0... 0. cece eee ees $27.00/$30.00 

Vols. 7-8 (1986-1987)... cece eee e eee $31.00/$34,00 

Vol. 9 (W988): J ietsaceetcdaawevsacs $38.00/$43.00 

Vol. 10 (avail. May 1989)..........06. $39.00/$43.00  Vol{S). — Vol. 
Annual Review of SOCIOLOGY 

Vols, 1-11 (1975-1985)... cc cee eee eee eee $27.00/$380.00 

Vols. 12-13 (1986-1987)... 0. cece eee eee $31.00/$34.00 

Vol. 14 (1988) ssn velievanisaeevenees $39.00/$43.00 

Vol. 15 (avail. Aug. 1989)..........66. $39.00/$43.00  Vol{s). Vol 


Se ed 








SPECIAL PUBLICATIONS Tacen FORpRIg Kogula Orre 
per volume Please Send: 
USA & Canada/elsewhere 
The Excitement and Fascination of Science 
Volume 1 (published 1965) Clothbound ........... $15.00/$19.00 Copyties). 
Volume 2 (published 1978) Hardcover ...........5- $25.00/$29.00 Copyties}. 
Softcover ........... ... $15.00/$19.00 Copyties}. 
Volume 3 Hardcover .......-.6 To be announced Copyties). 
Intelligence and Affectivity: 
Their Relationship During Child Development, by Jean Piaget 
{published 1981) Hardcover ........sespeserernorenerronreer> $8.00/$9.00 Copyltes). 
TO: ANNUAL REVIEWS INC., a nonprofit scientific publisher 
4139 El Camino Way 
P.O. Bex 10139 
Palo Alto, CA 94303-0897 USA 


Please enter my order for the publications checked above, California orders, add sales tax. 
Prices subject to change without notice. 


Institutional purchase order No. 


Amount of remittance enclosed $ 


INDIVIDUALS: Prepayment required in U.S. funds or 
charge to bank card below. Include card number, 
expiration date, and signature. 


Charge my account O VISA Acct. No. 
O MasterCard O American Express Exp. Date 
Signature 

Name 

Please print 
Address 

Please print 

Zip Code Date 
Send free copy of current Prospectus O 
Area(s) of Interest $ 


ARI Federal LD. #94-1156476 


rP 


Prices Postpaid 


ANNUAL REVIEWS SERIES per volume Regular Order Standing Order 


USA & Canada/elsewhere Please send: Begin with: 
Vol. number Vol number 


* 


Annual Review of MICROBIOLOGY 


Vols. 18-39 (1964-1985}...... seeecescen o $27.00/530.00 
Vols, 40-41 (1986-1987}...... sereen o ea e+ $31.00/$34.00 
Vol. 42 (OBB) sk sles raa . $34.00/$38.00 
Vol. 43 (avail. Oct. 1989) ......... . +». $34.00/$38.00 Vol{s),.-.- Vol. 
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Annual Review of NEUROSCIENCE 


Vols, 1-8 (1978-1985)... ee. cceeee esac: $2'7.00/$30.00 
Vols. 9-10 (1986-1987)... eee ew eee $31.00/$34.00 
Vol. 11 (1988) .nccccccannon seer oo o + 9934.00/838.00 
Val. 12 {avail. March 1989)..........- $34.00/$38.00 Vol{s).__..... Vol 
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Annual Review of NUCLEAR AND PARTICLE SCIENCE 


Vols, 12-35 {1962-1985}... cc cece eee $30.00/$33.00 
Vols. 36-37 {1986-1987}... cece eee eees $34.00/$37.00 
Vol. 38 (LOSS) operasi r uiis adden $49.00/$53.00 
Vol. 39 (avail. Dec. 1989) ..... PEE $49.00/$53.00 Vol(s)._._—___...._ Vol. 
Annual Review of NUTRITION 
Vols. 1-5 (1981-1985). i rssi $27.00/$30.00 
Vols. 6-7 (1986-1987)... .cccwcccnvecs $31.00/$34.00 
Vol. 8 (1988) aunan ns euR ATENEA $37.00/$41.00 
Vol. 9 (avail. July 1989)........000 0s $37.00/$41.00 Volfs) Vol 


FeO enna eee eee ee ee T nee errr irtititisiiiiiiiiiitie tir 


Annual Review of PHARMACOLOGY AND TOXICOLOGY 


Vols, 2-3, 5-25 {1962-1963, 1965-1985) .......$27.00/$30.00 . 

Vols. 26-27 (1986-1987)... cc cece ee eee $31.00/$34.00 

Vol. 28 {1988} .........05000. esse + $34.00/$38.00 

Vol. 29 {avail. April 1989) .........06. $34.00/$38.00 Vol(s})._...... VoL 
Annual Review of PHYSICAL CHEMISTRY 

Vols. 11-21, 23-36 (1960-1970, 1972-1985).......$28.00/$31.00 

Vols. 37-38 (1986-1987)... cece eee e ene ee $32,00/$35.00 

Vol, 39 (1988) 4 .vsewieneeaiwes .. +. $38.00/$42.00 

Vol. 40 (avail. Nov. 1989)..... atari $38.00/$42.00 — VollS). nemn VOL 
Annual Review of PHYSIOLOGY 

Vols, 19-47 (1957-19B5).......+..- <.. e o . $27.00/$30.00 

Vols. 48-49 (I98G-1087) soe sce ceca $32.00/$35.00 

Vol. 50 (IIBB) eeina taen bees $36.00/$40.00 

Vol, 51 (avail. March 1989)........... $36.00/$40.00 Vol{s)._____.... Vol. 
Annual Review of PHYTOPATHOLOGY 

Vals, 3-23 (1965-1985)... . ccc eee e wen ee $27.00/$30.00 

Vols. 24-25 (1986-1987)... .. cece eee e eee $31.00/$34.00 

Vol. 26 (1988) ..sssssseoossvuns , . . $36.00/$40.00 

Vol. 27 (avail. Sept. 1989) ..........+-$36,00/$40.00 Vol{s) Vol 
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Annual Review of PLANT PHYSIOLOGY AND PLANT MOLECULAR BIOLOGY 


Vols. 14-23, 25-86 (1963-1972, 1974-1985)....... $27.00/$30.00 
Vols. 37-38 {1986-1987}.........22 cece ee $31.00/$34.00 
Vol. 39 (IOBA): 5 once ccas eae .. +. . . $34.00/$38.00 
Vol. 40 favail. June 1989} ............ $34.00/838.00 Vol{s)._. sol. 
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Continued 


